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Description 
Related Applications 

[0001] For U.S. National Stage purposes, thfs application is a continuation-in-part of U.S. application Serial No. 
08/744,481 , filed November 6, 1 996. to Koster. entitled "DN A DIAGNOSTICS BASED ON MASS SPECTROMETRY". 
This application is also a continuation-in-part of U.S. application Serial Nos, 08/744,590, 06/746,036, 08/746,055, 
08/786,988, 08/787,639, 08/933,792 and U,S. application Serial No. atty dkt. no. 7352-2001 B, filed October 8, 1997, 
which ts a continuation-in-part of U.S. application Nos. 08/746,055, 08/786,988 and 08/787,639. For international pur- 
poses, benefit of priority is claimed to each of these applications. 

[0002] This application is related to U.S. Patent Application Serial No. 08/61 7,256 filed on March 18, 1 996. which is 
a continuation-in-part of U.S. application Serial No. 08/406.199, filed March 17, 1996, now U.S. Patent No. 5,605,798, 
and is also related U.S. Patent Nos. 5,547,835 and 5,622,824. 

[0003] Where pernnitted the subject matter of each of the above-noted patent applications and the patent is herein 
incorporated in its entirety. 

BACKGROUND OF THE INVENTION 

Detection of mutations 

[0004] The genetic information of all living organisms Ce.g. , animais, plants and microorganisms) is encoded in de- 
oxyribonucleic acid (DNA). In humans, the complete genome is contains of about 100,000 genes located on 24 chro- 
mosomes (The Human Genome, T Strachan, BIOS Scientific Publishers, 1 992). Each gene codes for a specific protein, 
which after its expression via transcription and translation, fuifills a specific biochemical function within a living cell. 
Changes in a DNA sequence are known as mutations and can result in proteins with altered or in some cases even 
lost biochemical activities; this in turn can cause genetic disease. Mutations include nucleotide deletions, insertions or 
aiteralions (i.e. point mutations). Point mutations can be either ''missense", resulting in a change in the amino acid 
sequence of a protein or "nonsense" coding for a stop codon and thereby leading to a truncated protein. 
[0005] More than 3000 genetic diseases are cun^ently known (Human Genome Mutations, D. N. Cooper and M. 
Krawczak, BIOS Publishers, 1993), including hemophilias, thalassemias, Duchenne Muscular Dystrophy (DMD), Hunt- 
ington's Disease (HD), Alzheimer's Disease and Cystic Fibrosis (CF). fn addition to mutated genes, which result in 
genetic disease, certain birth defects are the result of chromosomal abnormalities such as Trisomy 21 (Down's Syn- 
drome), Trisomy 13 (Patau Syndrome), Trisomy 1 8 (Edward's Syndrome), Monosomy X (Turner's Syndrome) and other 
sex chromosome aneuploidies such as Klienfelter's Syndrome (XXY), Further, there is growing evidence that certain 
DNA sequences may predispose an Individual to any of a number of diseases such as diabetes, arteriosclerosis, 
obesity, various autoimmune diseases and cancer (e.g. , colorectal, breast, ovarian, lung). 

[0006] Viruses, bacteria, fungi and other infectious organisms contain distinct nucleic acid sequences, which are 
different from the sequences contained in the host cell. Therefore, infectious organisms can also be detected and 
identified based on their specific DNA sequences. 

[0007] Since the sequence of about 1 6 nucleotides Is specific on statistical grounds even for the size of the human 
genome, relatively short nucleic acid sequences can be used to detect nonnal and defective genes In higher organisms 
and to detect infectious microorganisms ( e.g. , bacteria, fungi, protists and yeast) and viruses. DNA sequences can 
even serve as a fingerprint for detection of different individuals within the same species (see, Thompson, J. S. and M. 
W. Thompson, eds., Genetics in Medicine , W.B. Saunders Co., Phfladelphia, PA (1991)). 

[0008] Several methods for detecting DNA are cunrently being used. For example, nucleic acid sequences can be 
identified by comparing the mobility of an amplified nucleic acid fragment with a known standard by gel electrophoresis, 
or by hybridization with a probe, which is complementary to the sequence to be identified, Identfffcation, however, can 
only be accomplished if the nucleic acid fragment is labeled with a sensitive reporter function (e.g. , radioactive 
(^P, ^^3)^ fluorescent or chemi luminescent). Radioactive labels can be hazardous and the signals they produce decay 
over time. Non-lsotopic labels (e.g. , fluorescent) suffer from a lack of sensitivity and fading of the signal when high 
intensity lasers are being used. Additionally, perfomriing labeling, electrophoresis and subsequent detection are labo- 
rious, time-consuming and error-prone procedures. Electrophoresis fs particularly error-prone, since the size or the 
molecular weight of the nucleic acid cannot be directly correlated to the mobility in the gel matrix. It is known that 
sequence specific effects, secondary structure and Interactions with the gel matrix are causing artefacts. 

Use of mass spectrometry for detection and Identification of nucleic acJds 

[0009] Mass spectrometry provides a means of "weighing" individual molecules by ionizing the molecules in vaccuo 
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and making them "fly" by volatilization. Under the influence of combmations of electric and magnetic fields, the ions 
follow trajectories depending on their individual mass (m) and charge (z). In the range of molecules with low molecular 
weight, mass spectrometry has long been part of the routine physical-organic repertoire for analysis and characteri- 
zation of organic molecules by the determination of the mass of the parent molecular ion. In addition, by arranging 
cotlisions of this parent molecular ion with other particles ( e.g. , argon atoms), the molecular ion is fragmented forming 
secondary ions by the so-calJed collision induced dissociation (CID). The fragmentation pattern/pathway very often 
allows the derivation of detailed structural infonnation. Many applications of mass spectrometrlc methods are known 
in the art, particularly in biosciences (see, e^, Methods tn Enzymot. , Vol. 1 93r "Mass Spectrometry" (J, A. McCloskey 
editor), 1990, Academic Press, New York). 

[0010] Because of the apparent analytical advantages of mass spectrometry in providing high detection sensitivity, 
accuracy of mass measurements, detailed structural infonnation by CID in conjunction with an MS/MS configuration 
and speed, as wet! as on-line data transfer to a computer, there has been interest fn the use of mass spectrometry for 
the structural analysis of nucleic acids. Recent reviews summarizing this field include K.H. Schram, "Mass Spectrometry 
of Nucleic Acid Components. Biomedical Applications of Mass Spectrometry" 34, 203-287 (1 990); and R F. Grain, "Mass 
Spectrometrlc Techniques in Nucleic Acid Research/' Mass Spectrometry Reviews 9, 505-554 (1990); see, also U.S. 
Patent No. 5,547,335 and U.S. Patent No. 5,622,824). 

[0011] Nucleic acids, however, are very polar biopolymers that are very difficult to volatiMze. Consequently, mass 
spectrometrlc detection has been limited to low molecular weight synthetic oligonucleotides for confirming an already 
known oligonucleotide sequence by determining the mass of the parent molecular ion, or alternatively, confinning a 
known sequence through the generation of secondary ions (fragment ions) via CID m an MS/MS configuration using, 
in particular, for the ionization and volatilization, the method of fast atomic bombardment (FAB mass spectrometry) or 
plasma desorption (PD mass spectrometry). As an example, the application of FAB to the analysis of protected dimeric 
blocks for chemical synthesis of olfgodeoxynucleotides has been described (Koster et al (1987) Biomed. Environ. 
Mass Spectrometry 14, 111-116). 

[001 2] Other lonization/desorptlon techniques include electrosp ray/ion -spray (ES) and matrix-assisted laser desorp- 
tlon/ionlzation (MALDI). ES mass spectrometry has been introduced by Fenn et al. (J. Phys. Chem. 88 :4451 -59 (1 984); 
PCT Application No. WO 90/14148) and current applications are summarized in review articles (see, e^. Smith etaL 
(1990) Anal. Chem . 62:882-89 and Ardney (1992) Electrospnay Mass Spectrometry^ Spectroscopy Europe 4:10-18). 
The molecular weights of atetradecanucleotlde (see.Covey etaL (1988) The "Determination of Protein, Oligonucleotide 
and Peptide Molecular Weights by lonspray Mass Spectrometry," Rapid Commun. in Mass Spectrometry 2:249-256), 
and of a 21 -mer (Methods in Enzymol. . 193 , "Mass Spectrometry" (McCloskey, editor), p. 425, 1990, Academic Press, 
fvJew York) have been published. As a mass analyzer, a quadrupole is most frequently used. Because of the presence 
of multiple ion peaks thatall could be usedforthe mass calculation, the detemnlnation of molecular weights in femtomole 
amounts of sample is very accurate. 

[0013] MALDI mass spectrometry, In contrast, can be attractive when a ttme-of-fllght (TOP) configuration (see, 
Hillenkamp et al. (1 990) pp 49-60 in "Matrix Assisted UV-Laser Desorption/tonization: A New Approach to Mass Spec- 
trometry of Large Biomolecuies," Biological Mass Spectrometry . Burlingame and McCloskey editors, Elsevier Science 
Publishers, Amsterdam) is used as a mass analyzer. Since, in most cases, no multiple molecular ion peaks are produced 
with this technique, the mass spectra, in principle, look simpler compared to ES mass spectrometry. 
[0014] Although DNA molecules up to a molecular weight of 410,000 daJtons have been desorbed and volatilized 
(Williams etal., "Volatilization of High Molecular Weight DNA by Pulsed Laser Ablation of Frozen Aqueous Solutions," 
Science 246, 1585-87 (1989)), this technique had only shown very low resolution (otigothymfdyllc acids up to 18 nucle- 
otides, Huth-Fehre et al Rapid Commun. in Mass Spectrom. ^ 6, 209-13 (1 992); DNA fragments up to 500 nucleotides 
in length K. Tang etaL, Rapid Commun. in Mass Spectrom. . 8, 727-730 (1 994); and a double-stranded DNA of 28 base 
pairs (Williams etai, "Time-of- Flight Mass Spectrometry of Nucleic Acids by Laser Ablation and Ionization from a Frozen 
Aqueous Matrix," Rapid Commun. in Mass Spectrom. , 4, 348-351 (1 990)). Japanese Patent No, 59-131909 describes 
an instrument, which detects nucleic acid fragments separated either by electrophoresis, liquid chromatography or high 
speed gel filtration, Mass spectrometric detection is achieved by incorporating into the nucleic acids, atoms, such as S, 
Br, I or Ag, Au, Pt, Os, Hg, that nomnaily do not occur in DNA. 

[0015] Co-owned U.S. Patent No. 5,622,824 describes methods for DNA sequencing based on mass spectrometric 
detection. To achieve this, the DNA is by means of protection, specificity of enzymatic activity, or immobilization, uni- 
laterally degraded in a stepwise manner via exonuclease digestion and the nucleotides or derivatives detected by mass 
spectrometry. Prior to the enzymatic degradation, sets of ordered deletions that span a cloned DNA fragment can be 
created. In this manner, mass-modified nucleotides can be incorporated using a combination of exonuclease and DNfiJ 
RNA polymerase. This permits either multiplex mass spectrometric detection, or modulation of the activity of the exo- 
nuclease so as to synchronize the degradative process. Co-owned U.S. Patent Nos. 5.605,798 and 5,547,835 provide 
methods for detecting a particular nucleic acid sequence in a biological sample. Depending on the sequence to be 
detected, the processes can be used, for example, in methods of diagnosis. These methods, while broadly useful and 
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applicable to numerous embodiments, represent the first disclosure of such applications and can be improved upon. 
[0016] Therefore, It Is an object herein to provided improved methods for sequencing and detecttng DNA molecules 
fn biological samples. It Is also an object herein to provided Improved methods for diagnosis of genetic diseases, 
predispositions to cetain diseases, cancers, and infections. 

SUMMARY OF THE INVENTION 

£0017] Methods of diagnosis by detecting and/or detenning sequences of nucleic acids that are based on mass 
spectrometry are provided herein. Methods are provided for detecting double-stranded DNA, detecting mutations and 
other diagnostic markers using MS analysis. In particular, methods for diagnosing neuroblastoma, detecting heredity 
relationships, HLAcompatibifity, genetic fingerprinting, detecting teleromase activity for cancer diagnosis are provided. 
[0018J In certain embodiments the DNA is immobilized on a solid support either directly or via a linker and/or bead. 
Three permutions of the methods for DNA detection in which immobitized DNA is used are exemplified. These include: 
(1) immobilization of a template; hybridization of the primer; extension of the primer, or extension of the primer (singie 
ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease degradation (sequencing); (2) 
immoblfization of a primer; hybridization of a single stranded template; and extension of the primer, or extension of the 
primer (single ddNTP) for sequencfng or diagnostics or extension of the primer and Endonuclease degradation (se- 
quencing); (3) immobflizatlon of the primer; hybridization of a double stranded template; extension of the primer, or 
extension of the primer (single ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease 
degradation (sequencing). 

[0019] In certain embodiments the DNA is immobilized on the support via a selectively cleavabfe linker Selectively 
cJeavable linkers include, buta are not limited to photocleavable linkers, chemically cJeavable linkers and an enzymat- 
ically (such as a restriction site (nucleic acid linker), a protease site) cleavable linkers. Inclusion of a selectively cleavable 
linker expands the capabflities of the MALDI-TOF MS analysis because it allows for all of the permutations of immo- 
bijizatjon of DNA for MALDI-TOF MS, the DNA linkage to the support through the 3'- or 5^-end of a nucleic acid; allows 
the amplified DNA or the target primerto be extended by DNA synthesis; and further allows for the mass of the extended 
product (or degraded product via exonuclease degradation) to be of a size that is appropriate for MALDI-TOF MS 
analysis (i.e,, the isolated or synthesized DNA can be large and a small primer or a large primer sequence can be used 
and a small restriction fragment of a gene or single strand thereof hybridized thereto). 

[0020] In a preferred embodiment, the sefectively cleavable linker is a chemical or photocleavable linker that is 
cleaved during the ionizing step of mass spectrometry. Exemplary linkers include linkers containing, a disutfide group, 
a ieuvinyl group, an acid-laJDlie trityi group and a hydrophobic trftyl group. Jn other embodiments, the enzymatically 
cleavable tinker can be a nucleic acid that is an RNA nucfeotide or that encodes a restriction endonuclease site. Other 
enzymatically cleavable linkers include linkers that contain a pyrophosphate group, an arginrne-arginine group and a 
lysine-lysine group. Other linkers are exemplified herein. 

[0021] Methods for sequencing long fragments of DNA are provided. To perform such sequencing, specific base 
terminated fragments are generated from a target nucleic acid. The analysis of fragments rather than the full length 
nucleic acid shifts the mass of the ions to be determined into a lower mass range, which is generally more amenable 
to mass spectometric detection. For example, the shift to smaller masses Increases mass resolution, mass accuracy 
and, in particular, the sensitivity for detection. Hybridization events and the actual molecular weights of the fragments 
as determined by mass spectrometry provide sequence infonnation (e.g. , the presence and/or identity of a mutation). 
In a preferred embodiment, the fragments are captured on a solid support prior to hybridization and/or mass spectrom- 
etry detection. In another preferred embodiment, the fragments generated are ordered to provide the sequence of the 
larger nucfeic acid. 

[0022J One preferred method for generating base specifically terminated fragments from a nucleic acid is effected 
by contacting an appropriate amount of a target nucleic acid with an appropriate amount of a specific endonuclease, 
thereby resulting in partial or complete digestion of the target nucleic acid. Endonucleases will typically degrade a 
sequence into pieces of no more than about 50-70 nucfeotides. even if the reaction is not run to full completion. In a 
preferred embodiment, the nucleic acid is a ribonucleic acid and the endonuclease is a ribonuclease (RNase) selected 
from among: the G-speclfic RNase T^, the A-specifIc RNase U2, the A/U specific RNase PhyM, U/C specific RNase 
A, C specific chicken liver RNase (RNase CL3) or crisavitin. In another preferred embodiment, the endonuclease is a 
restriction enzyme that cleaves at least one site contained within the target nucleic acid. Another preferred method for 
generating base specifically temninated fragments includes performing a combined amplification and base-specific 
termination reaction (e^, using an appropriate amount of a first DNA polymerase, which has a relatively low affinity 
towards the chain-terminating nucleotides resulting in an exponential amplification of the target; and a polymerase with 
a relatively high affinity for the chain terminating nucleotide resulting In base-specific termination of the polymerization. 
Inclusion of a tag at the 5* and/or 3' end of a target nucleic acid can facilitates the ordering of fragments. 
[0023] Methods for detemnlning the sequence of an unknown nucleic acid in which the 5' and/or 3' end of the target 
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nucl9ic acid can include a tag are provfded. Inclusion of a non-natural tag on the 3' end is also useful for rulmg out or 
connpensating for the Influence of 3' heterogeneity, premature termination and nonspecific elongation. In a preferred 
embodiment, the tag is an affinity tag (e.g. , biotin or a nucleic acid that hybridizes to a capture nucleic acid). Most 
preferably the affinity tag faciiitates binding of the nucleic acid to a solid support. In another preferred embodiment, the 
tag is a mass marker (i.e. a marker of a mass that does not correspond to the mass of any of the four nucleotides), in 
a further embodiment, the tag is a natural tag, such as a poiyA tail or the natural 3' heterogeneity that can result, for 
example, from a transcription reaction. 

[0024] Methods of sequence analysis in which nucleic acids have been replicated from a nucleic acid molecule 
obtained from a biological sample are specifically digested using one or more nucleases (deoxyribonudeasesfor DNA, 
and ribonucleases for RNA) are provided. The fragments captured on a solid support carrying the corresponding com- 
plementary sequences. Hybridization events and the actual molecular weights of the captured target sequences provide 
infonnation on mutations in the gene. The array can be analyzed spot-by-spot using mass spectrometry. Further, the 
fragments generated can be ordered to provide the sequence of the larger target fragment, 

[0025] In another embodfment, at least one primer with a 3'-terminal base is hybridized to the target nucleic acid 
near a site where possible mutations are to be detected. An appropriate polymerase and a set of three nucleoside 
triphosphates (NTPs) and the fourth added as a terminator are reacted. The extension reaction products are measured 
by mass spectrometry and are indicative of the presence and the nature of a mutation. The set of three NTPs and one 
dd-NTP (or three NTPs and one S'-deoxy NTP), will be varied to be able to discriminate between several mutations 
(including compound heterozygotes) in the target nucleic acid sequence. 

[0026] Methods for detecting and diagnosing neoplasia/malagnancfes in a tissue or cell sample are provided. The 
methods rely on a telomeric repeat amplification protocol (TRAP) -MS assay and include the steps of: 

a) obtaining a tissue or a cell sample, such as a clinical isolate or culture of suspected cells; 

b) isolating/extracting/purifying telomerase from the sample; 

c) adding the telomerase extract to a composition containing a synthetic DNA primer, which is optionally immobi- 
lized, complementary to the telomeric repeat, and allfour dNTPs under conditions that result in telomerase specific 
extension of the synthetic DNA; 

d) amplifying the telomerase extended DNA products,, preferably using a primer that contains a "linker moiety", 
such as a moiety based on thiol chemistry or streptavidin; 

e) isolating linker-ampfified primers, such as by using a complementary binding partner immobilized on a solid 
support; 

f) optionally conditioning the DNA for crystal formation; and 

g) perfomnrng MS by lonrzlngA^oratizing the sample to detect the DNA product. 

Telomerase-specific extension fs indicative of neoplaisa/malfgnancy. 

This method can be used to detect ect specffic malignancies. The use of MS to detect the DNA product pemiits iden- 
tification the extended product, which is indicative of telomerase activity in the sample. If desired, the synthetic DNA 
can be in the form an array. 

[0027] Methods for detecting mutations are provided and the use thereof oncogenes and to thereby screen fortrans- 
formed ceils, which are indicative of neoplasia. Detection of mutations present fn oncogenes are indicative of trans- 
formation. This method includes the steps of: 

a) obtaining a biological sample; 

b) amplifying a portion of the selected proto-oncogene that includes a codon indicative of transformation, where 
one primer has a linker moiety for immobilization; 

c) immobilFzIng DNA via the linker moiety to a solid support, optionaKy in the form of an array; 

d) hybridizing a primercompiementary to the proto oncogene sequence that is upstream from the codon 

e) adding 3dNTPs/1 ddNTP and DNA polymerase and extending the hybridized primerto the next ddNTP location; 

f) ionizing/volatizing the sample; and 

g) detecting the mass of the extended DNA, whereby mass Indicates the presence of wild-type or mutant alleles. 
The presence of a mutant allele at the codon is diagnostic for neoplasia. 

In an exemplary embodiment, extension-MS analysis is used detect the presence of a mutated codon 634 in the ret- 
rovlms {RET)-proto oncogene. 

[0028] In another embodiment, methods for diagnosing diseases using reverse transcription and amplification of a 
gene expressed in transformed cells. In particular, a method for diagnosis of neuroblastoma using reverse transcriptase 
(RT)-MS of tyrosine hydroxylase, which is a catecholamine bicsynthetic enzyme that expressed in tumor cells, but not 
in nomial cells, such as normal bone marrow cells is provided. The method includes the steps of: 
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a) obtaining a tissue sample; 

b) isolating polyA RNAfrom the sample; 

c) preparing a cDNA library using reverse transcription; 

d) amplifing a cDNA product, or portion thereof, of the selected gene, where one oligo primer has a linker moiety; 

e) isolating the ampHfied product by immobilizing the DNA to solid support via the linker moiety; 

f) optionally conditioning the DNA: 

g) ionizing/volatizing sample and detecting the presence of a DNA peak that is indicative of expression of the 
selected gene gene. For example, expression of the tyrosine hydroxylase gene is Indicative of neuroblastoma. 

[0029] Also provided are methods of directly detecting a double-stranded nucleic acid using MALDl-TOF MS. These 
methods include the steps of: 

a) isolating a double stranded DNA of an appropriate size for MS via amplffication methods orfonned by hybridi- 
zation of single-stranded DNA fragment; 

b) preparing the double-stranded DNA for analysis under conditions that increase the ratio of dsDNA:ssDNA in 
which the conditions mciude one or all of the following: preparing sampJes for analysis at reduced temperatures 
(J-Q- 4 • C), and using of higher DNA concentrations in the matrix to drive duplex formation 

c) ionizing/volatizing the sample of step b), where this step uses low acceleration voltage of the tons to assist in 
maintaining duplex DNA by, for example, adjusting laser powerto just above threshold Irradiation for ionization, and 

d) detecting the presence of the dsDNA of the appropriate mass. 

In preferred embodiments, the matrix Includes 3-hydroxypicolinic acid. The detected DNA can be indicative of a genetic 
disorder, genetic disease, genetic predisposition to a disease chromosomal abnormalities. In other embodiments, the 
mass of the double stranded DNA is indicative of the deletion, insertion, miitation. 

[0030] A method designated primer oligo base extension (PROBE) is provided. This method uses a single detection 
primer followed by an oligonucleotide extension step to give products, which can be readtly resolved by MALDl-TOF 
mass spectrometry, The products differ in length by a number of bases speciftc for a number of repeat units or for 
second site mutations within the repeated region. The method is exemplified using as a model system the AluVpA 
polymorphism in intron 5 of the interferon-a receptor gene located on human chromosome 21 , and the poly T tract of 
the spJice acceptor site of tntron 8 from the CFTR gene located on human chromosome 7. The method is advanta-^ 
geously used for example, for determining identity, identifying mutations, familial relationship, HLA compatability and 
other such markers ,using PROBE-MS analysis of microsatellite DNA. In a preferred embodiment, the method includes 
the steps of: 

a) obtaining a bioioglcal sample from two individuals; 

b) amplifying a region of DNA from each individual that contains two or more microsatellite DNA repeat sequences 

c) ionizing/volatizing the amplified DNA; 

d) detecting the presence of the amplified DNA and comparing the molecular weight of the amplified DNA. Different 
sizes are indicative of non-identity (Le. wild-type versus mutation), non-heredity or non-compatibility; similar size 
fragments indicate the possibility identity, of familial relationship, or HLA compatibility. 

[0031] More than one marker may be examined simulataneoulsy, primers with different linker moieties are used for 
immobilization. 

[0032] Another method loop-primer oligo base extension, designated LOOP-PROBE, for detection of mutations es- 
pecially predominant disease causing mutations or common polymorphisms is provided. In a particular embodiment, 
this method for detecting target nucleic acid in a sample, includes the steps of: 

a) amplifying a target nucleic acid sequence, such as p-globin, in a sample, using (i) a first primer whose 5'-end 
shares identity to a portion of the target DNA Immediately downstream from the targeted codon followed by a 
sequence that introduces a unique restriction endonuclease site, such as Cfol in the case of p-globin. Into the 
ampiicon and whose 3^-end primer is setf-complementary; and (ii) a second downstream primer that contains a 
tag, such as biotin, for immobilizing the DNA to a solid support, such as streptavidin beads; 

c) fmmobilizing the double-stranded amplified DNA to a solid support via the linker moiety; 

d) denaturing the immobilized DNA and isolating the non-immobilized DNA strand; 

e) annealing the intracomplementary sequences in the 3'-end of the Isolated non-immobllzed DNA strand, such 
that the 3'-end is extendable by a polymerase, which annealing can be performed, for example, by heating then 
and cooling to about 37* C, or other suitable method; 

f) extending the annealed DNA by adding DNA polymerase, 3 dNTPs/1 ddNTP, whereby the 3'-end of the DNA 
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strand is extended by the DNA polymerase to the position of the nextddNTP iocation {ie^, to the mutation iocatfon); 
g) cieaving the extended doubie stranded stem ioop DNA with the unique restriction endonuclease and removing 
the cleaved stem ioop DNA 

i) (optionaily adding a matrix) ionizing/voiatizing the extended product; and 

j) detecting the presence of the extended target nucleic acid, whereby the presence of a DNA fragment of a mass 
different from wild-type is indicative of a mutatEon at the target codon(s). 

This method eliminatesonespecffic reagent for mutation detection compared other methods of MS mutatfonal analyses, 
thereby simpJifying the process and rendering it amenable to automation. Also, the specific extended product that is 
analyzed is cleaved from the primer and is therefore shorter compared to the other methods. In addition, the anneaiing 
efficfency is higher compared to annealing of an added primer and should therefore generate more product. The process 
is compatible with multiplexing and various detection scJiemes (e^, singfe base extension, oligo base extension and 
sequencing). For example, the extension of the loop-primer can be used for generation of short diagnostic sequencing 
ladders within highly polymorphic regions to perfomi, for example, HLA typing or resistance as well as species typfng. 
[0033] In another emodiment, a methods of detecting a target nucleic acid in a biological sample using RNA ampli- 
fication is provided. In the method, the target is amplified the target nucleic acid, using a primer that shares a region 
complementary to the target sequence and upstream encodes a promoter, such as the T7 promoter. A DNA-dependent 
RNA polymerase and appropriate ribonucleotides are added to synthesize RNA, which is analyzed by MS. 
[0034] Improved methods of sequencing DNA using MS are provided. In these methods thermocyclfng for amplrfi- 
cation is used prior to MS analysis, thereby increasing the signai. 

[0035] Also provide are primers for use in MS analyses. In particular, primers, comprising all or, for longer oligonu- 
cleotides, at least about 20, preferably about 16, bases of any of the sequence of nucleotfdes sequences set forth in 
SEQ fD NOs. 1-22, 24. 27-38, 41-86, B9, 92. 95, 98, 101-110, 112-123, 126, 128, 129. and primers set forth rn SEQ 
ID Nos. 280-287. The primers are unlabeled, and optionally include a mass modifying moiety, which is preferably 
attached to the 5'end. 

[0036] Other features and advantages of the methods provided herein will be further described with reference to the 
following Figures, Detailed Description and Claims. 

BRIEF DESCRIPTJON OF THE FIGURES 

[0037] FIGURE 1 A is a diagram showing a process for performing mass spectrometric analysis on one target de- 
tection site (TDS) contained within a target nucleic actd molecule (T), which has been obtained from a biological sample. 
A specific capture sequence (C) is attached to a solid support (SS) via a spacer (S). The capture sequence is chosen 
to specifically hybridize with a complementary sequence on the target nucleic acid molecule (T), known as the target 
capture site (TCS). The spacer (S) facilitates unhindered hybridization. A detector nucleic acid sequence (D), which 
is complementary to the TDS is then contacted with the TDS. Hybridization between D and the TDS can be detected 
by mass spectrometry. 

[0038] FIGU RE 1 B is a diagram showing a process for perfonning mass spectrometric analysis on at least one target 
detection site (here TDS 1 and TDS 2) via direct linkage to a solid support. The target sequence (T) containing the 
target detection site (TDS 1 and TDS 2) is immobilized to a solfd support via the fonnation of a reversible or irreversibie 
bond formed between an appropriate functionality (U) on the target nucleic acid molecule (T) and an appropnate func- 
tionality (L) on the solid support. Detector nucieic acid sequences (here D1 and D2), which are complementary to a 
target detection site (TDS 1 or TDS 2) are then contacted with the TDS. Hybridization between TDS 1 and D1 and/or 
TDS 2 and D2 can be detected and distinguished based on molecular weight differences. 

[0039] FIGURE 1 C is a diagram showing a process for detecting a wildtype (D^t) and/or a mutant (D^"*) sequence 
in a target (T) nucleic acid molecule. As in Figure 1 A, a specific capture sequence (C) is attached to a solid support 
(SS) via a spacer (S). In addition, the capture sequence is chosen to specifically interact with a complementary se- 
quence on the target sequence (T), the target capture site (TCS) to be detected through hybridization. If the target 
detection site (TDS) includes a mutation, X, etection sites can be distinguished from wildtype by mass spectrometry. 
Preferably, the detector nucleic acid molecule (D) is designed so that the mutation Is In the middle of the molecule and 
therefore would not lead to a stable hybrid if the wildtype detector oligonucleotide (D^t) is contacted with the target 
detector sequence, e^, as a controi. The mutation can also be detected ff the mutated detector oligonucleotide (D"i"t) 
with the matching base at the mutated position is used for hybridization. If a nucleic acid molecule obtained from a 
biological sample is heterozygous for the particular sequence (i.e. contain 0*^ and D^^wt), D** and D"^^^ will be bound 
to the app and D'""* to be detected simultaneously. 

[0040] FIGURE 2 is a diagram showing a process in which several mutations are simultaneously detected on one 
target sequence molecular weight differences between the detector oligonucleotides D1, D2 and D3 must be large 
enough so that simultaneous detection (multiplexing) is possible. This can be achieved either by the sequence itself 
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(compositfon or length) or by the introduction of mass-modifying functionalities M1-IVI3 into tiie detector oligonucJeotide. 
[0041] FIGURE 3 rs a diagram showing still another multiplex detection format, in this embodiment, differentiation 
is accompJIsiied by employing different specific capture sequences whicii are position-specificaliy immobilized on a 
fJat surface (e.g. , a 'chip array'). If different target sequences T1 -Tn are present, their target capture srtes TCS1 -TCSn 
will interact with complementary immobilized capture sequences C1-Cn. Detection is achieved by employing appro- 
priately mass differentiated detector oligonucleotides Dl-Dn, which are mass differemiated either by their sequences 
or by mass modifying functionalities M1-IVIn. 

[0042] FIGURE 4 is a diagram showing a format wherein a predesigned target capture site (TCS) is incorporated 
into the target sequence using nucleic acid (re., PGR) amplification. Only one strand is captured, the other is removed 
(e^, based on the interaction between biotin and streptavidin coated magnetic beads), if the biotin is attached to 
primer 1 the other strand can be appropriateiy marked by aTCS. Detection Is asdescrlbedabovethrough the Interaction 
of a specific detector oligonucleotide D with the con-esponding target detection site TDS via mass spectrometry 
[0043] FIGURE 5 is a diagram showing how amplification (here ligase chain reaction (LCR)) products can be prepared 
and detected by mass spectrometry. Mass differentiation can be achieved by the mass modifying functionalities (M1 
and M2) attached to primers (P1 and P4 respectively). Detection by mass spectrometry can be accomplished directly 
(i.e. without employing Immobfiization and target capturing sites (TCS)). Multiple LCR reactions can be performed fn 
parallel by providing an ordered array of capturing sequences (C). This fomnat allows separation of the ligation products 
and spot by spot identification via mass spectrometry or multiplexing tf mass differentiation Is sufficient. 
[0044] FiGURE 6A is a diagram showing mass spectrometric analysis of a nucleic acid molecule, which has been 
amplified by a transcription amplification procedure. An RNA sequence is captured via its TCS sequence, so that 
wildtype and mutated target detection sites can be detected as above by employing appropriate detector oligonucJe- 
otides (D). 

[0045] FIGURE 6B is a diagram showing multipJexing to detect two different (mutated) sites on the same RNA in a 
simultaneous fashion using mass-modified detector oligonucleotides M1-D1 and M2'D2. 

[0046] FIGURE 6C is a diagram of a different multiplexing procedure for detection of specific mutations by empJoying 
mass modified dideoxynucleoside or 3'-deoxynucl8oslde triphosphates and an RNA dependent DNA polymerase. Al- 
ternatively, DNA dependent RNA polymerase and ribonucleotide phosphates can be employed. This format allows for 
simultaneous detectfon of all four base possibilities at the site of a mutation (X). 

[0047] FIGURE 7A is a diagram showing a process for performing mass spectrometric anaiysis on one target de- 
tection site (TDS) contained within atarget nucleic acid molecule (T), which has been obtained from a biological sample. 
A specific capture sequence (C) is attached to a solid support (SS) via a spacer (S). The capture sequence is chosen 
to specifically hybridize with a complementary sequence on T known as the target capture site (TCS). A nucleic acid 
moiecule that is complementary to a portion of the TDS is hybridized to the TDS 5' of the site of a mutation (X) within 
the TDS. The addition of a complete set of dideoxynucleosides or 3'-deoxynucleoside triphosphates (e.g. > pppAdd, 
pppTdd, pppCdd and pppGdd) and a DNA dependent DNA or RNA polymerase ailows forthe addition only of the one 
dideoxynucleoside or 3"-deoxynucleoside triphosphate that is complementary to X. 

[0048] FIGURE 7B is a diagram showing a process for performing mass spectrometric analysis to determine the 
presence of a mutation at a potential mutation site (M) within a nucleic acid molecule. This format allows for simulta- 
neous analysis of alleles (A) and (B) of a double stranded target nucleic acid molecule, so that a diagnosis of ho- 
mozygous normal, homozygous mutant or heterozygous can be provided. Allele A and B are each hybridized with 
complementary oilgonucleotides ((C) and (D) respectively), that hybridize to A and B within a region that includes M. 
Each heteroduplex is then contacted with a single strand specific endonuclease, so that a mismatch at M, indicating 
the presence of a mutation, results In the cleavage of (C) and/or (D), which can then be detected by mass spectrometry. 
[0049] FIGURE 8 Is a diagram showing how both strands of a target DNA can be prepared for detection using tran- 
scription vectors having two different promoters at opposite locations (e.g. , the SP6 and T7 promoter). This fonnat is 
particularly useful for detecting heterozygous target detections sites (TDS). Employing the SP6 or the T7 RNA polymer- 
ase both strands could be transcribed separately or simultaneously. The transcribed RNA molecules can be specifically 
captured and simultaneously detected using appropriately mass-differentiated detector oligonucleotides. This can be 
accomplished either directly in solution or by parallel processing of many target sequences on an ordered array of 
specifically immobilized capturing sequences. 

[0050] FIGURE 9 is a diagram showing how RNA prepared as described in Figures 6, 7 and B can be specifically 
digested using one or more ribonucleases and the fragments captured on a solid support carrying the corresponding 
complementary sequences. Hybridization events and the actual molecular weights of the captured target sequences 
provide information on whether and where mutations in the gene are present The array can be analyzed spot by spot 
using mass spectrometry. DNA can be similarly digested using a cocktail of nucleases including restriction endonucle- 
ases. Mutations can be detected by different molecular weights of specific, Individual fragments compared to the mo- 
lecular weights of the wildtype fragments. 

[0051] FIGURE 1 0A shows UV spectra resulting from the experiment described in the following Example 1 . Panel 
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i) shows the absortoance of the 26-mer before hybriclization. Panel ii) shows the filtrate of the centiifugatlon after hy- 
bridization. Panel iij) shows the results after the first wash with 50 mlW ammonium citrate. Panel rv) shows the results 
after the second wash with 60 mM ammonium citrate. 

[0052] FIGURE 10B shows a mass spectrum resulting from the experiment described in the following Example 1 
after three washing/centrifugation steps. 

[0053] FIGURE IOC shows a mass spectrum resulting from the experiment described in the following Example 1 
showing the successful desorption of the hybridized 26-mer off of beads in accordance with the format depicted sche- 
maticaily in Figure 1 B. 

[0054] FIGURE 11 shows a mass spectrum resulting from the experiment described in the following Example 1 
showing the giving proof of an experiment as schematically depicted in FIGURE 1 B successful desorption of the hy- 
bridized 40-mer. The efficiency of detection suggests that fragments much longer than 40-mers can also be desorbed. 
Figure 12 shows a mass spectrum resulting from the experiment described in the following Example 2 showing the 
successful desorption and differentiation of an 18-mer and 19-mer by etectrospray mass spectrometry, the mixture 
(top), peaks resulting from 18-mer empiiasized (middle) and peaks resulting from 1 9-mer emphasized (bottom) 
[0055] FIGURE 13 is a graphic representation of the process for detecting the Cystic Fibrosis mutation AF60S as 
described in Example 3. 

[0056] FIGURE 14 is a mass spectrum of the DNA extension product of a AFSOS homozygous normal of Examples. 
[0057] FIGURE 15 is a mass spectrum of the DNA extension product of aAF508 heterozygous mutant of Examples. 
[0058] FIGU RE 1 6 is a mass spectrum of the DNA extension product of a AF50& homozygous normal of Example 3. 
[0059] FIGURE 1 7 is a mass spectrum of the DNA extension product of a AFS08 homozygous mutant of Example 3. 
[0060] FIGURE 1 8 is a mass spectrum of the DNA extension product of a AF508 heterozygous mutant of Example 3, 
[0061] FIGURE 1 9 is a graphic representation of various processes for performing apolipoprotein E genotyping of 
Example 4. 

[0062] FIGURE 20 shows the nucleic acid sequence of normal apolipoprotein E (encoded by the E3 allele, FIG, 20B) 
and other isotypes encoded by the E2 and E4 alleles (FiG. 20A). 

[0063] FIGURE 21 A shows a composite restriction pattern for various genotypes of apolipoprotein E using the Cfof 
restriction endonuclease. 

[0064] FIGURE 21 B shows the restriction pattern obtained in a 3.5% MetPhor Agarose Gel for various genotypes 
of apolipoprotein E. 

[0065] FIGURE 21 C shows the restriction pattern obtained In a 12% pofyacrylamide gel for various genotypes of 
apolipoprotein E. 

[0066] FIGURE 22A is a chart showing the molecular weights of the 91, 83, 72, 48 and 35 base pair fragments 
obtained by restriction enzyme cleavage of the E2, E3 and E4 alleles of apolfpoprotein E. 

[0067] FIGURE 22B tsthe mass spectrum of the restriction product of a homozygous E4 apolipoprotein E genotype. 
[0068] FIGURE 23A Is the mass spectrum of the restriction product of a homozygous E3 apolipoprotein E genotype. 
[0069] FIGURE 23B is the mass spectrum of the restriction product of a E3/E4 apolfpoprotein E genotype. 
[0070] FIGURE 24 is an autoradiograph of Example 5 of a 7.5% polyacrylamide gel in which 10% (Sjxl) of each 
amplified sample was loaded: sample M: pBR322 >4/u/digested; sample 1 : HBV positive in seroiogjcal analysis; sample 
2: also HBV positive; sample 3 : without serological analysis but with an increased level of transaminases, indicating 
liver disease; sample 4 : HBV negative containing HCV; sample 5 : HBV posit-) negative control; (+) positive control). 
Staining was done with ethidium bromide, 

[0071] FIGURE 25A is a mass spectrum of sample 1 , which is HBV positive. The signal at 20754 Da represents the 
HBV related amplification product (67 nucleotides, calculated mass: 20735 Da). The mass signal at 10390 Da repre- 
sents the [M+2H]2+ molecule ion (calculated: 1 0378 Da). 

[0072] FIGURE 25B is a mass spectrum of sample 3, which is HBV negative corresponding to nucleic acid (Le., 
PGR), serological and dot blot based assays. The amplified product Is generated only in trace amounts. Nevertheless 
it is unambiguously detected at 20751 Da (calculated mass: 20735 Da), The mass signal at 1 0397 Da represents the 
[M+2H]2+ molecule ion (calculated: 10376 Da). 

[0073] FIGURE 25C is a mass spectrum of sample 4, which is HBV negative, but HCV positive. No HBV specific 
signals were observed. 

[0074] FIGURE 26 shows a part off the E. cofi tad gens with binding sites of the complementary oligonucelotides 
used in the ligase chain reaction (LCR) of Example 6. Here the wildtype sequence is displayed. The mutant contains 
apoint mutation at bp 191 which is also the site of ligation (bold). The mutation is a G to T transition (Gto A, respectively). 
This leads to a T-G mismatch with oligo B (and A-C mismatch with oiigo C, respectively). 

[0075] FIGURE 27 is a 7.15% polyacrylamide gel of Example 6 stained with ethidium bromide. M: chain length 
standard (pUCI 9DNA, Msp\ digested). Lane 1 ; LCR with wildtype template. Lane 2: LCR with mutant template. Lane 
3: (control) LCR without template. The ligation product (50 bp) was only generBted in the positive reaction containing 
wildtype template. 


EP 1 164 203 A2 


[0076] FIGU RE 28 is an HPLC chromatogram of two pooled positive LCRs, 

[0077] FIGURE 29 shows an HPLC chromatogram the same conditions but mutant template were used. The smail 
signai of the ligation product is due to eithertemplate-free ligation of theeducts orto a ligation ata (G-T, A-C) mismatch. 
The 'false positive' signal is significantly lower than the signal of ligation product with wildtype template depicted in 
Figure 28. The analysis of ligation educts leads to 'double-peaks' because two of the oligonucleotides are 5'-phospho- 
rylated. 

[0078] FIGURE 30 In (b)the complex stgnal pattern obtained by MALDI-TOF-MS analysis of P/uDNA-ligase solution 
of Example 6 is depicted. In (a) a MALDI-TOF-spectrum of an unpurified LCR is shown. The mass stgnal 67569 Da 
probably represents the Pfu DNA iigase. 

[0079] FIGU RE 31 shows a MALDI-TOF spectrum of two pooled positive LCRs (a). The signal at 7523 Da represents 
unligated oligo A (calculated: 7521 Da) whereas the signal at 15449 Da represents the ligation product (cafculated: 
1 5450 Da). The signal at 3774 Da is the [M + 2H]2+ signal of ollgo A. The signals in the mass range lower than 2000 
Da are due to the matrix rons. The spectrum con-esponds to lane 1 in figure 27 and the chromatogram in figure 28. In 
(b) a spectrum of two pooled negative LCRs (mutant template) (s shown. The signal at 7517 Da represents oligo A 
(calculated: 7521 Da). 

[0080] FIGURE 32 shows a spectrum of two pooled control reactions (with salmon sperm DNA as template). The 
signals In the mass range around 2000 Da are due to Tween20, only oiigo A could be detected, as expected, 
[0081 ] FIGU RE 33 shows a spectrum of two pooled positive LCRs (a). The purification was done with a combination 
of ultrafiltration and streptavidin DynaBeads as described in the text. The signal at 15448 Da represents the ligatfcn 
product (calculated: 1 5450 Da). The signal at 7627 represents oligo A (calculated: 7521 Da). The signals at 3761 Da 
Is the [M+2H]£+ signal of oligo A, whereas the signal at 5140 Da is the signal of the ligation product. In (b) 

a spectrum of two pooled negative LCRs (without template) is shown. The signal at 7514 Da represents oligo A (cal- 
culated: 7521 Da), 

[0082] FIGURE 34 is a schematic presentation of the oligo base extension of the mutation detection primer as de- 
scribed in Example 7, using ddTTP (A) or ddCTP (B) in the reaction mix, respectively. The theoretical mass calculation 
is given in parenthesis. The sequence shown is part of the exon 1 0 of the CFTR gene that bears the most common 
cystic fibrosis mutation AF508 and more rare mutations AI507 as welf as iie506Ser 

[0083] FIGURE 35 Is a MALDI-TOF-MS spectrum recorded directly from precipitated oligo base extended primers 
for mutation detection. The spectrum in (A) and (B), respectively show the annealed primer (CF508) without further 
extension reaction. Panel C displays the MALDI-TOF spectrum of the wild type by using pppTdd in the extension 
reaction and D a heterozygotic extension products carrying the 506S mutation when using pppCdcf as terminator. 
Panels E and F show a heterozygote with AF508 mutation wfth pppTdd and pppCdd as terminators In the extension 
reaction. Panels G and H represent a homozygous AF50B mutatron with either pppTdd or pppCdd as tenTiinalors. The 
template of diagnosis is pointed out below each spectrum and the observed/expected molecular mass are written In 
parenthesis. 

[0084] FIGURE 36 shows the portion of the sequence of pRFd DNA, which was used as template for nucfejc acid 
amplification in Example 8 of unmodified and 7-deazapurine containing 99-mer and 200-mer nucleic acids as well as 
the sequences of the 1 9-mer forward primer and the two 18-mer reverse primers. 

[0085] FIGURE 37 shows the portion of the nucleotide sequence of Mi 3mp1 8 RFI DMA, whtch was used in Example 
8 for nucleic acid amplification of □nmodified and 7-dGazapurine containing 103-mer nucleic acids. Also shown are 
nucleotfde sequences of the 1 7-mer primers used in the nucleic acid ampiffication reaction. 

[0086] FIGURE 38 shows the result of a polyacrylamide gel electrophoresis of amplified products described in Ex- 
ample 8 purified and concentrated for MALDI-TOF MS analysis. M: chain length marker, lane 1: 7-dea2apurine con- 
taining 99-mer amplified product, lane 2: unmodified 99-mer, lane 3: 7-deazapurine containing 103-mer and lane 4: 
unmodified 103-mer amplified product 

[0087] FIGURE 39: an autoradiogram of polyacrylamide gef electrophoresis of nucleic acid (Le., PGR) reactions 
carried out with 5'-[32p]-iabeled primers 1 and 4. Lanes 1 and 2: unmodified and 7-deazapurine modified 103-mer 
amplified product (53321 and 23520 counts), lanes 3 and 4; unmodified and 7'deazap urine modified 200-mer (71123 
and 39582 counts) and lanes 5 and 6: unmodified and 7-deazapurine modified 99-mer (173216 and 94400 counts). 
[0088] FIGURE 40 a) MALDI-TOF mass spectrum of the unmodified 103-mer amplified products (sum of twetve 
single shot spectra). The mean value of the masses calculated for the two single strands (31 768 u and 31 759 u) is 
31763 u. Mass resolution: 18. b) MALDI-TOF mass spectrum of 7-deazapurine containing 103-mer amplified product 
(sum of three single shot spectra). The mean value of the masses calculated for the two single strands (31 727 u and 
31719 u) is 31723 u. Mass resolution: 67. 

[0089] FIGU RE 41: a) MALDI-TOF mass spectrum of the unmodified 99-mer amplified product (sum of twenty single 
shot spectra). Values of the masses calculated forthe two single strands: 30261 u and 30794 u. b) MALDI-TOF mass 
spectrum of 7-deazapurine containing 99-mer amplified product (sum of twelve single shot spectra). Values of the 
masses calculated forthe two single strands: 30224 u and 30750 u. 
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[0090] FIGURE 42: a) MALDI-TOF mass spectrum of the unmodified 200-mer amplified product (sum of 30 single 
shot spectra). The mean value of the masses calculated for the two single strands (61873 u and 61595 u) is 61734 u. 
Mass resolution: 28. b) MALDI-TOF mass spectrum of 7-deazapurine containing 200-mer amplified product (sum of 
30 single shot spectra). The mean value of the masses calculated for the two single strands (61 772 u and 61 714 u) is 
61643 u. Mass resolution: 39. 

[0091] FIGURE 43: a) MALDI-TOF mass spectrum of 7-deazapurine containing 100-mer amplified product with ri- 
bomodifled primers. The mean value of the masses calculated for the two single strands (30529 u and 31095 u) is 
30812 u. b) MALDI-TOF mass spectrum of the amplified product after hydrolytic primer-cleavage. The mean value of 
the masses calculated for the two single strands (25104 u and 25229 u) is 25167 u. The mean value of the deaved 
primers (5437 u and 591 8 ir) is 5677 u. 

[0092] FIGU RE 44 A-D shows the MALDI-TOF mass spectrum of the four sequencing ladders obtained from a39-mer 
template (SEQ ID No. 23), which was immobilized to streptavidin beads via a 3' biotinylation. A 14-mer primer (SEQ 
ID NO. 24) was used in the sequencing according to Example 9. 

[0093] FIGURE 45 shows a MALDI-TOF mass spectrum of a solid phase sequencing of a 7e-mer template (SEQ ID 
No. 25), which was immobilized to streptavidin beads via a 3" biotirrylation. A 18-mer primer (SEQ ID No. 26) and 
ddGTP were used in the sequencing. 

[0094] FIGURE 46 shows a scheme in which duplex DNA probes with single-stranded overhang capture specific 
DNA templates and also serve as primers for solid phase sequencing, 

[0095] FIGURE 47 A-D shows MALDi-TOF mass spectra obtained from a sequencing reaction using 5' fluorescent 
labeled 23-mer (SEQ ID No. 29) annealed to a 3' biotinylated IB-mer (SEQ ID No. 30), leaving a 5-base overhang, 
which captured a 1 5-mer template (SEQ ID No. 31) as described in Example 9. 

[0096] FIGURE 48 shows a stacking fluorogram of the same products obtained from the reaction described in FIG- 
URE 47, but run on a conventional DNA sequencer. 

[0097] FIGURE 49 shows a MALDI-TOF mass spectrum of the sequencing ladder using cycle sequencing as de- 
scribed in Example 1 generated from a biological amplified product as template and a 12mer (5'-TQC ACC TGA CTC- 
3' (SEQ ID NO. 34)) sequencing primer. The peaks resulting from depurinations and peaks which are not related to 
the sequence are marked by an asterisk. MALDI-TOF MS measurements were taken on a reflectron TOF MS. A.) 
Sequencing ladder stopped with ddATP; B.) Sequencing ladder stopped with ddCTP; C.) Sequencing ladder stopped 
with ddGTP; D.) Sequencing ladder stopped with ddTTP. 

[0098] FIGURE 50 shows a schematic representation of the sequencing ladder generated in Fig. 49 with the corre- 
sponding calculated molecuiar masses up to 40 bases after the primer. For the calculation, the following masses were 
used; 3581 .4Da for the primer, 31 2.2 Da for 7-deaza-dATP, 304.2 Da for dTTP, 289.2 Da for dCTP and 328.2 Da for 
7-deaza-dGTP. 

[0099] FIGURE 51 shows the sequence of the ampfified 209 bp amplified product within the p-giobin gene, which 
was used as a template for sequencing. The sequences of the appropriate amplification primer and the location of the 
12mer sequencing primer is also shown. This sequence represents a homozygote mutant at the position 4 bases after 
the primer. In a wildtype sequence this T would be replaced by an A. 

[0100] FIGURE 52 shows a sequence which is part of the intron 5 of the interferon -receptor gene that bears the 
AluVpA polymorphism as further described in Example 11. The scheme presents the primer oligo base extension 
(PROBE) using ddGTR ddCTP, or both for termination, respectively. The poiymorphism detection primer (IFN) is un- 
deriined, the tennination nucleotides are marked in bold letters. The theoretical mass values from the alleles found in 
28 unrelated individuals and a five member family are given in the table. Both second site mutations found in most 13 
units alieie, but not all, are indicated. 

[0101] FIGURE 53 shows the MALDI-TOF-MS spectra recorded directly fomn precipitated extended cyclePROBE 
reaction products. Family study using AluVpA polymorphism in intron 5 of the interferon-a receptor gene (Example 11 ). 
[0102] FIGURE 54 shows the mass spectra from PROBE products using ddC as tennination nucleotide in the reaction 
mix. The allele with the molecular mass of approximately 11 650 da from the DNA of the mother and child 2 is a hint to 
a second site mutation within one of the repeat units, 

[0103] FIGURE 55 shows a schematic presentation of the PROBE method for detection of different alleles in the 
polyT tract at the 3'-end of intron B of the CFTR gene with pppCdd as terminator (Example 11 ), 
[0104] FIGURE 56 shows the MALDI-TOF-MS spectra recorded directly from the precipitated extended PROBE 
reaction products. Detection of all three common alleles of the polyT tract at the 3' end of intron 8 of the CFTR gene. 
(a)T5/T9 heterozygous, (b) T7/T9 heterozygous {Example 11). 

[0105] FIGURE 57 shows a mass spectrum of the digestion of a 252-mer ApoE gene amplified product (G3/G3 
genotype) as described in Example 12 using a) Cfot alone and b) Cfol plus Rsal. Asterisks: depurination peaks. 
[0106] FIGURE 58 shows a mass spectrum of the ApoE gene amplified product (B3/G3 genotype) digested by Cfol 
and purified by a) single and b) double ethanol/glycogen and c) double isopropyl alcohol/glyccgen precipitations. 
[0107] FIGURE 59 shows a mass spectrum of the Cfol/Rsal digest products from a) G2/G3, b) G3/e3, c) G3/G4, 
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and d) e4/G4 genotypes. Dashed lines are drawn through diagnostic fragments. 

[0108] FIGURE 60 shows a scheme for rapid identification of unknown ApoE genotypes following simultaneous 
digestion of a 252-mer apo E gene amplified product by the restriction enzymes Cfoi and Rsal. 
[0109] FiGURE61 shows themultlplex(codDns 112 and 158) mass spectrum PROBE resuitsfora) G2/G3, b) G3/G3, 
c) E3/G4, and d) G4/G4 genotypes. E: extension products; P: unextended primer. Top: codon 112 and 158 regions, 
with poiymoiphic sites bold and primer sequences underlined, 

[0110] FIGURE 62 shows a mass spectrum of aTRAP assay to detect teiome rase activity (Example 13), The spec- 
trum shows two of the primer signals of the amplified product TS primer at 5,497.3 Da (calc. 5523 Da) and the bioti- 
nylated bioCX primer at 7,537.6 Da (calc. 7,537 Da) and the first telom erase-specif ic assay product containing three 
teJomeric repeats at 1 2,775.8 Da (calc. 1 2,452 Da) its mass is largerby one dA nucleotide (1 2,765 Da) due to extendase 
activity of Taq DNA polymerase. 

[Oil 1 ] FIGURE 63 depicts the higher mass range of FIGURE 62, i.e. the peak at 12,775.6 Da represents the products 
with these telomeric repeats. The peaks at 20,322.1 Da is the result of a telomerase activity to form seven tefomeric 
repeats (calc. 20,395 Da including the extension by one dA nucleotide). The peaks marked 1,2,3 and 4 contain a four 
telomeric repeats at 14,674 Da as well as secondary ion product. 

[0112] FIGURE 64 displays a MALDI-TOF spectrum of the RT-amplified product of the human tyrosine hydroxylase 
mRNA indicating the presence of neuroblastoma cells (Example 14). The signal at 18,763.8 Da represents the non- 
biotinylated single-stranded 61 mer of the nested amplified product (calc. 18,758.2 Da). 

[0113] FIGURE 65 (a) shows a schematic representation of a PROBE reaction for the RET proto-oncogene with a 
mixture of dATP, dCTP, dGTP, and ddTTP (Example 15). B represents biotin, through which the sense template strand 
is bound through slreptavidin to a solid support. Figure 65(b) shows the expected PROBE products for ddT and ddA 
reactions for wildtype, C^T, and C-^A antisense strands. 

[0114] FIGURE 66 shows the PROBE product mass spectra for (a) negative control, (b) Patient 1 being heterozygote 
(Wt/C->T) and (c) Patient 2 being heterozygote (Wt/C->A), reporting average values. 

[0115] FIGURE 67 shows the MALDI-FTMS spectra for synthetic analogs representing rtbo-cleaved RET prolo- 
oncogene amplified products from (a) wildtype, (b) G->A, and (c) G->T homozygotes, and (d) wildtype/G^A, (e) 
wildtype/G^T, and (f) G-5^A/G->T heterozygotes, reporting masses of most abundant isotope peaks. 
[01 16] FIGU RE 68 is a schematic representation of nucleic acid immobilization via covalent bifunctlona! trityl linkers, 
[0117] FIGURE 69 is a schematic representation of nucleic acid immobilization vfa hydrophobic trityl linkers. 
[01 18] FIGURE 70 shows a MALDI-TOF mass spectrum of a supernatant of the matrix treated Dynabeads containing 
bound oligo (S'-iminobiotin - TGCACCTGACTC, SEQ ID NO. 56). An internal standard (CTGTGGTCGTGC, SEQ ID 
NO. 57) was included in the matrix, 

[0119] FIGURE 71 shows a MAl,DI-TOF mass spectrum of a supernatant of the matrix treated Dynabeads containing 
bound oligo (5'-iminobiotin ■ TGCACCTGACTC, SEQ ID NO. 56), An internal standard (CTGTGGTCGTGC, SEQ ID 
NO. 57) was included in the matrix. 

[0120] FIGURE 72 schematically depicts the steps involved with the Loop-primer oligo base extension (Loop-probe) 
reaction. 

[0121] FIGURE 73A shows a MALDI-TOF mass spectrum of a supernatant after Cfoi digest of a stem loop. Figure 
73B-D show MALDI-TOF mass spectrum of different genotypes: HbA the wildtype genotype (74B), HbC, a mutation 
of codon 6 of the p-globin gene which causes sickle cell disease (74C), and HbS, a different mutation of codon 6 of 
the p-globin gene which causes sickle cell disease (740). 

[0122] FIGURE 74 shows the nucleic acid sequence of the amplified region of CKR-5, The underlined sequence 
corresponds to the region homologous to the amplification primers. The dotted region corresponds to the 32 bp deletion. 
[0123] FIGURE 75 shows the sense primer ckrT7f. Being designed to facilitate binding of T7-RNA polymerase and 
amplification of the CKR-5 region to be analyzed, it starts with a randomly chosen sequence of 24 bases, the T7 
promoter sequence of 1 8 bases and the sequence homologous to CKR'5 of 1 9 bases, 

[0124] FIGURE 76 is a MALDI-TOF mass spectrum of the CKR-5 amplification product, which was generated as 
described in the following Example 21 . 

[0125] FIGURE 77 is a positive ion UV-MALDI mass spectra of a synthetic RNA 25-mer (5'-UCCGGUCUGAU- 
GAGUCCGUGAGGAC-3' SEQ ID NO. 62) digested with selected RNAses. For each enzyme 0,6^il atlquots of teh 4.5^tl 
assay containing a total of ca. 20 pmol of the RNA were fixed with 1 .5^1 matrix (3-HPA) for analysis. Fragments with 
retained 5'-terminus are marked by different arrows, specific for the different RNAses, (Hahner et aL, Proceedings of 
the 44th ASMS Conference on Mass Spectrometry and Allied Topics, p. 983 (1996)). 

[0126] FIGURE 73 Is an investigation of the specificity of the RNAses CL3 and Cusativin by positive Ion UV-MALDI 
mass spectra of a synthetic RNA 20-mer. Expected and/or observed cleavage sites are indicated by arrows. A, B, C 
indicate correct cleavage sites and corresponding singly cieaved fragments. Missing cleavages are designated by a 
question mark (?), unspecific cleavages by an X. 

[0127] FIGURE 79 shows the separation of a mixture of DNA molecules (12-mer, 5'-biot. 1 Q-mer, 22-mer and 5'-biot. 
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27-mer) with slreptavi din-coated magnetic beads, a) positive ion UV-M ALDI mass spectrum of 0.6^i of a mixture con- 
taining ca. 2-4 pmoi of eacli species mixed with 1.5^1 matrix (3-HPA). b) same as a) but incubatron of the mixture with 
magnetic beads and subsequent release of the captured fragments. 

[0128] FiGURE 80 Elution of immobilizedS* biotinylated 49 nt in wfro transcript from the streptavidin-coated magnetic 
beads. Positive UV-MALDi mass spectrum of the transcript prior to incubation with the magnetic beads (a). Spectra of 
the immobiiized RNA transcript after elution with 95% fomnamide alone (b) and with various additives such as 10mM 
EDTA (c), 10mM CDTA (d) and 25% ammonium hydroxide (e); EDTA and CDTA were adjusted with 25% ammonium 
hydroxide to a pH of 8. 

[0129] FiGURE 81 Positive UV-IVIALDI mass spectra of the 5' biotinylated 49 nt in vitro transcript after RNAse Ug 
digest for 15 minutes, a) Spectrum of the 25 ul assay containing ca. 100 pmoi of the target RNA before separation; b) 
spectrum after isolation of the 5'-biotinyiated fragments with magnetic beads. Captured fragments were released by a 
solution of 95% formamide containing 1 0 mM CDTA. 1 ui aliquots of the samples were mixed with 1 .5 ui matrix (3-HPA) 
in both cases. 

[0130] FIGURE 82 schematicaify depicts detection of putative mutations fn the human p-globin gene at codon 5 and 
6 and at codon 30, and the iVS-1 donor site, respectively, done in paraitei. FIGURE 82A shows amplification of genomic 
DNA using the primers ^2 and p11 . The location of the primers and identffication tags as well as an indication of the 
wild type and mutant sequences are shown. FiGURE 82B shows analysis of both sites in a simple Primer Reaction 
Oligo Base Extension (PROBE) using primers p-TAGI (which binds upstream of codon 5 and 6) and |3-TAG2 (which 
binds upstream of codon 30 and the IVS-1 donor site). Reaction products are captured using streptavidin -coated par- 
amagnetic particie bound biotinylated capture primers {cap-tag-1 and cap-tag-2, respectively), that have 6 bases at 
the 5' end that are complementary to the 5' end of P-TAG1 and |3-TAG2, respectively and a portion which binds to a 
universal primer. 

[0131] FiGURE 83 shows a mass spectrum of the PROBE products of a DNA sample from one Individual analyzed 
as described schematically in FiGURE 82. 

[0132] FIGURE 84 shows a mass spectrum of the sequence bound to cap-tag-2. 

[0133] FIGURE 85 shows a mass spectrum obtained by using the p-TAGI and p-TAG 2 primers in one sequencing 

reaction using ddATP for termination and then sorting according to the method depicted In FIGURE 82. 

[0134] FiGURE 86 shows a mass spectrum obtained by using the p-TAGI and P-TAG2 primers in one sequencing 

reaction using ddCTP for termination and then sorting according to the method depicted in FiGURE 82. 

[0135] FiGU RE S7A shows the wlldtype sequence of a fragment of the chemokine receptor CKR-6 gene with primers 

(bold) used for amplification. The 32 base pair (bp) deletion in the CKR-5 allele is underlined; and the stop nucleotides 

are in italic, fn FIGURE 876, the wildtype strands are depicted with and without an added Adenosine, their length and 

molecular masses are indicated. FiGURE 87C indicates the same for the 32 bp deletion. FIGURE S7D shows the 

PROBE products for the wildtype gene and FIGURE 87E shows the mutated allele. 

[0136] FiGURE 88 shows the amplification products of different unrelated individuals as analyzed by native polyacr- 
yiamide gel electrophoreses (15%) and silver stain. The band corresponding to a wildtype CKR-6 runs at 75 bp and 
the band from the gene with the deletion at 43 bp. Bands bigger than 75 bp are due to unspeclfic amplification. 
[0137] FIGURE 89A shows a spectrograph of DNA derived from a heterozygous individual: the peak with a mass of 
2331 9 Da corresponds to the wildtype CKR-5 and the peal<s with masses of 1 31 37 Da and 13451 Da to the deletion 
atiele with and without an extra Adenosine, respectively FiGURE 89B shows a spectrograph of DNA obtained from 
the same individual as in FiGURE 89A, but the DNA was treated with T4 DNA polymerase to remove the added Ade- 
nosine. FIGURES 89C and S9D are spectrographs derived from homozygous individuals and in FIGURE 890, the 
Adenosine has been removed. All peaks with masses lower than 13000 Da are due to multiple charged molecules. 
[0138] FIGURE 90A shows the mass spectrum of the results of a PROBE reaction performed on DNA obtained from 
a heterozygous individual. FiGURE 90B shows a mass spectrum of the results of a PROBE reaction on a homozygous 
individual. The peaks with masses of 6604 Da and 6607 Da, respectively correspond to the wlidtype allele, and the 
peak with a mass of 6275 Da to the deletion ai lele. The primer is detected with a mass of 5673 and 5676 Da, respectively 
[0139] FIGURE 91 shows a MALDi-TOF MS spectra of a thermocycling primer Ollgo Base Extension (tc-PROBE) 
reaction as described in Example 24 using three different templates and 5 different PROBE primers simultaneously in 
one reaction. 

[0140] FIGURE 92 schematically depicts a single tube process for amplifying and sequencing exons 5-8 of the p53 
gene as described in Example 25. The mass spectrum is the A reaction of Figure 93, 

[0141] FiGURE 93 shows a superposition plot of four separate reactions for sequencing a portion of exon 7 of the 
p53 gene as described in Example 25. 

[0142] FiGURE 94 shows the mass spectrum obtained from the A reaction for sequencing a portion of exon 7 of the 
p53 gene as described in Example 25. 

[0143] FiGURE 95 shows the mass spectrum of a p53 sequencing ladder for which 5nL of each reaction were trans- 
ferred to wells of a chip and measured by MALDi-TOF. 
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[0144] FIGURE 96A shows a MALDI-TOF mass spectra of a synthetic 50-mer (1 5.34 kDa) mixed with 27-mernc (non- 
complementary, 8.30 l<Da), 

[0145] FIGURE 96B shows a MALDI-TOF mass spectra of a synthetic 50-mer {15,34k Da) mixed with a 27-merj. 
{complementary, 8.34 kDa). The final concentration of each oligonucleotide was 10|a.M. The signal at 23.68 kDa in 
Figure 96B corresponds to WC-specif ic dsDNA. 

[0146] FIGURE 97A shows a MALDI-TOF mass spectrum of Cfol/Rsal digest products of a region of exon 4 of the 
apolipoprotein E gene (G3 genotype), using sample preparation as in Figure 96. 

[0147] FIGURE 97B is the same as Figure 97A, except with samples prepared for MALDI-TOF analysis at 4*C, 
[0148] FIGURE 98 shows a MALDI-TOF mass spectrum of Cfol/Rsal simultaneously double digest products of a 
252 base pair region of exon 4 of the apolipoprotein E gene (e4 genotype), with samples prepared at 4*'C, 
[0149] FIGURE 99 shows the mass spectra obatined on a small population study of 15 patients with a 16 element 
array of diagnostic products transferred to a MALDI target using a pintool microdispenser. 

[0150] FIGURE 1 00 is a MALDI mass spectrum of an aliquot sampled after a T^ digest of a synthetic 20-mer RNA. 

DETAPLED DESCRiPTION OF THE INVENTION 

Definitions 

[0151] UnJess defined otherwise, all technical and scientific temns used herein have the same meaning as is com- 
monly understood by one of skill in the art to which this invention belongs. Where permitted the subject matter of each 
of the co-pending patent applications and the patent is herein incorporated in its entirety. 

[0152] As used herein, the term "biological sample" refers to any material obtained from any living source ( e.g. , 
human, animal, plant, bacteria, fungi, protist, virus). For purposes herein, the biological sample will typically contain a 
nucleic acid molecule. Examples of appropriate biological samples include, but are not limited to: solid materials ( e.g. , 
tissue, cell pellets, biopsies) and biological fluids (e.g. , urine, blood, saliva, amniotic fluid, mouth wash, cerebral spinal 
fluid and other body fluids). 

[0153] As used herein, the phrases "chain-elongating nucleotides" and "chatn-terminating nucleotides" are used in 
accordance with their art recognized meaning. For example, for DNA, chain-elongating nucleotides include 2*deoxyrl- 
bonucJeotides ( e.g. , dATP, dCTP, dGTP and dTTP) and chain-temiinating nucleotides include 2', 3'-dideoxyribonucle- 
otides ( e.g. , ddATP, ddCTP, ddGTP, ddTTP). For RNA, chain-elongating nucleotides include ribonucleotides (e g. , 
ATJP, CTP, GTP and UTP) and chain-temninating nucleotides include 3'-deoxyribonucleotides ( e.g. , 3'dA, 3'dC, 3'dG 
and 3*dU). A complete set of chain elongating nucleotides refers to dATP, dCTP, dGTP and dTTP. The tenn "nucleotide" 
is also well known in the art. 

[0154] As used herein, nucleotides include nucleoside mono-, dr-, and triphosphates. Nucleotides also include mod- 
ified nucleotides such as phosphorothioate nucleotides and deazapurine nucleotides. A complete set of chain-elon- 
gating nucleotides refers to four different nucleotides that can hybridize to each of the four different bases comprising 
the DNA tempJato. 

[0155] As used herein, the superscript O-i designates i -i- 1 mass differentiated nucleotides, primers or tags. In some 
instances, the superscript O can designate an unmodified species of a particular reactant, and the superscript i can 
designate the i-th mass-modified species of that reactant. If, for example, more than one species of nucleic acids are 
to be concurrently detected, then i + 1 different mass-modified detector oligonucleotides (0°, D"", ... D') can be used 
to distinguish each species of mass modified detector oligonucleotides (D) from the others by mass spectrometry. 
[0156] As used herein, "multiplexing" refers to the simultaneously detection of more than one analyte, such as more 
than one (mutated) loci on a particular captured nucleic acrd fragment (on one spot of an array). 
[0157] As used herein, the term "nucleic acid" refers to single-stranded and/or double-stranded polynucleotides such 
as deoxyribonucleic acid (DNA), and ribonucleic acid (RNA) as well as analogs or derivatives of either RNA or DNA. 
Also included in the term "nucleic acid" are analogs of nucleic acids such as peptide nucleic acid (PNA), phospho- 
rothioate DNA, and other such analogs and derivatives. 

[0158] As used herein, the tenn "conjugated" refers stable attachment, preferably ionic or covalent attachment. 
Among preferred conjugation means are: streptavidin- or avidin- to biotin Interaction; hydrophobic interaction; magnetic 
Interaction (e.g. . using functionalized magnetic beads, such as DYNASEADS, which are streplavi din-coated magnetic 
beads sold by Dynal, Inc. Great Neck, NY and Oslo Nonway); polar interactions, such as "wetting" associations between 
two polar surfaces or between oligo/polyethylene glycol; formation of a covalent bond, such as an amide bond, disulfide 
bond, thioether bond, or via crosslinking agents; and via an acid-labile or pholocleavable linker. 
[0159] As used herein equivalent, when referring to two sequences of nucleic acids means that the two sequences 
in question encode the same sequence of amino acids or equivaient proteins. When "equivalent" is used In referring 
to two proteins or peptides, it means that the two proteins or peptides have substantfally the same amino acid sequence 
with only conservative amino acid substitutions that do not substantfafly alter the activity or function of the protein or 
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peptide. When "equivalent" refers to a property, the property does not need to be present to the same extent [e.g. , two 
peptides can exhibit different rates of the same type of enzymatic activity], but the activities are preferably substantially 
the same. "Complementary," when referring to two nucleotide sequences, means that the two sequences of nucleotides 
are capable of hybridizing, preferably with less than 25%, more preferably with less than 15%, even more preferably 
with less than 5%, most preferably with no mismatches between opposed nucleotides. Preferably the two molecules 
will hybridize under conditions of high stringency. 

[0160] As used herein: stringency of hybridization in detennining percentage mismatch are those conditions under- 
stood by those of skill in the art and typicaily are substantially equivalent to the following: 

1) high strfngency: 0.1 x SSPE, 0.1% SDS. 65*C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50*C 

3) low stringency: 1.0 x SSPE, 0.1% SDS, 50'C 

It is understood that equivalent stringencies may be achieved using alternative buffers, salts and temperatures. 
[0161] As used herein, a primer when set forth in the claims refers to a primersuitabfe for mass spectrometric methods 
requiring immobilfzing, hybridizing, strand displacement, sequencing mass spectrometry refers to a nucleic acid must 
be of low enough mass, typicaily about 70 nucleotides or less than 70, and of sufficient size to be useful fn the mass 
spectrometric methods described herein that reiy on mass spectrometric detection. These methods include primers 
for detection and seequening of nucleic acids, which require a sufficient number nucleotides to from a stable duplex, 
typicaily about 6-30, preferably about 1 0-25. more preferably about 12-20. Thus, for purposes herein a primer will be 
a sequence of nucleotides comprising about 6-70, more preferably a 12-70, more preferably greater than about 14 to 
an upper limit of 70, depending upon sequence and application of the primer. The primers herein, for example for 
mutational analyses, are selected to be upstream of loci useful for diagnosis such that when performing using sequenc- 
ing up to or through the site of Interest, the resulting fragment is of amass that sufficient and not too large to be detected 
by mass spectrometry. For mass spectrometric methods, mass tags or modifier are preferably included at the 6'-end, 
and the primer is otherwise unlabeled. 

[0162J As used herein, "conditioning" of a nucleic acid refers to modification of the phosphodrester backbone of the 
nucleic acid molecule ( e.g. . cation exchange) for the purpose of eliminating peak broadening due to a heterogeneity 
in the cations bound per nucleotide unit. Contacting a nucleic acid molecule with an alkylating agent such as akyiiodide, 
iodoacetamide, 0-iodoethanol, or2,3-epoxy-1 -propanol, the monothio phosphodiester bonds of a nucleic acid molecule 
can be transfonned into a phosphotriester bond. Likewise, phosphodiester bonds may be transfonmed to uncharged 
derivatives employing trialkylsftyl chlorides. Further conditioning involves incorporating nucleotides that reduce sensi- 
tivity for depurination (fragmentation during MS) e^, a purine analog such as N7- or N9-deazapurine nucleotides, or 
RNA building blocks or using oligonucleotide trfesters or incorporating phosphorothioate functfons that are alkylated 
or employing oligonucleotide mimetics such as peptide nucleic acid (RNA). 

[0163] As used herein, substrate refers to an insoluble support onto which a sample is deposited according to the 
materials described herein. Examples of appropriate substrates include beads (e.g., silica gel, controlled pore glass, 
magnetic, agaroase gele and crosslinked dextroses (le. Sepharose and Sephadex, cellulose and other materials 
known by those of skill in the art to serve as solid support matrices. For examples substrates may be formed from any 
or combltlons of: sitica gel, glass, magnet, polystyrene/l % divinylbenzene resins, such as Wang resins, which are 
Fmoc-amlno acid-4-{hydroxymethyl)phenoxymethylcopoly(styrene-1 % divtnyibenzene (DVD)) resin, chlorotrityl 
(2-chlorotrityichloride copolystyrene-DVB resin) resin, IWerrifieid (chloromethylated copotystyrene-DVB) resin metal, 
plastic, cellulose, cross-linked dextrans, such as those sold under the tradename Sephadex (Pharmacia) and agarose 
gel, such as gels sold under the tradename Sepharose (Pharmacia), which is a hydrogen bonded polysaccharl de-type 
agarose gel, and other such resins and solid phase supports known to those of skill in the art. The support matrices 
may be in any shape or fomi, includfng, but not limited to: capillaries, flat supports such as glass fiber filters, glass 
surfaces, metai surfaces (steel, gold, silver, aluminum, copper and silicon), plastic materials including multiwell plates 
or membranes (e.g., of polyethylene, polypropylene, polyamide, polyvinylldenedifluoride), pins (e.g., arrays of pins 
suitable for combinatorial synthesis or analysis or beads in pits of fiat surfaces such as wafers (e.g.. silicon wafers) 
with or without piates, and beads. 

[0164] As used herein, a selectively cleavable linker Is a linker that is cleaved under selected conditions, such as a 
photocieavable linker, a chemically cleavable linker and an enzymaticalty cleavable linker (i.e., a restriction endonu- 
clease site or a ribonucleotfde/RNase digestion). The linker is interposed between the support and immobilized DNA. 

Isolation of nucleic acJds molecules 

[0165] Nucleic acid molecules can be isolated from a particular biological sample using any of a number of proce- 
dures, which are well-known In the art, the particular isolation procedure chosen being appropriate for the particular 
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biologicai sample. For example, freeze-thaw and alkafine fysis procedures can be useful for obtaining nucleic acid 
molecules from solid materials; heat and alkaline lysis procedures can be useful for obtaining nucleic acid molecules 
from urine; and proteinase K extraction can be used to obtain nucleic acid from blood (see, e^, Rotff et aL (1994) 
PGR: Clinical Diagnostics and Research, Springer). 

[0166] To obtain an appropriate quantity of a nucleic acid molecules on which to perfomn mass spectrometry, ampli- 
fication may be necessary. Examples of appropriate amplification procedures for use herein include: cloning (Sambrook 
et al., MolecularCloning; A Laboratory Manual . Cold Spring Harbor Laboratory Press ,1989), polymerase chain reaction 
(PGR) (C.R. Newton and A. Graham, PGR, BIOS Publishers, 1 994), ligase chain reaction (LOR) (see, e.g., Weidmann 
etal,(1994) PGR Methods Appl . Vol. 3, Pp. 57-64; F Barany (1991) Proc. Natl. Acad. Sci. U.S.A. 8B:189-93>. strand 
displacement amplification (SDA) (see, e^, Walker etal, (1994) Nudeic Acids Res. 22:2670-77^ and variations such 
as RT-PCR (see, e^, HIguchI et aL (1993) Brorrechnology 11:1026-1030), allele-specific amplification (ASA) and 
transcription based processes. 

Immobilization of nucleic acid molecules to solid supports 

[0167] To facilitate mass spectrometric analysis, a nucleic acid molecule contafntng a nucleic acid sequence to be 
detected can be immobilized to an insoluble (i.e., a solid) support. Examples of appropriate solid supports include 
beads (e.g. , silica gel, controlled pore glass, magnetic, Sephadex/Sepharose, ce(luiose), capillaries, flat supports such 
as glass fiber filters, glass surfaces, metal surfaces (steel, gold, silver, aluminum, copper and silicon), plastic materiafs 
includtng multrwell plates or membranes ( e.g. > of polyethylene, polypropylene, polyamide, poiyvinylidenedifluoride), 
pins (e^, arrays of pins suitable for combinatorial synthesis or analysis or beads in pits of flat surfaces such as wafers 
( e.g. , silicon wafers) with or without filter plates. 

[0168] Samples containing target nucieic acids can be transferred to solid supports by any of a variety of methods 
known to those of skill in the art. For example, nucleic acid samples can be transferred to individual wells of a substrate, 
e^, silicon chip, manually or using a pintool mFcrodispenser apparatus as described herein. Alternatively, a piezoe- 
lectric pipette apparatus can be used to transfer small nanoliter samples to a substrate pemiitting the performance of 
high throughput miniaturized diagnostics on a chip. 

[0169] Immobilization can be accomplished, for example, based on hybridization between a capture nucleic acid 
sequence, which has aJready been immobilfzed to the support and a complementary nucleic acid sequence, which Is 
also contained within the nucieic acid molecule containing the nucleic acid sequence to be detected (FIGURE 1 A). So 
that hybridization between the complementary nuclefc acid molecules is not hindered by thesupport, the capture nucleic 
acid can incJude an e^, spacer region of at least about five nucJeotides in length between the solid support and the 
capture nucleic acid sequence. The duplex formed will be cleaved under the Influence of the laser pulse and desorption 
can be initiated. The solid support-bound nucleic acid molecule can be presented through natural oligoribo- or oNgo- 
deoxyribonucJeotideas well as analogs (e.g. , thio-modified phosphodlester or phosphotriester backbone) or employing 
oligonucieotide mimetics such as PNA analogs (see, e^, Nielsen etai. Science 254 :1497 (1991)) which render the 
base sequence less susceptible to enzymatic degradation and -bound capture base sequence. 

Lin leers 

[0170] A target detection site can be directly linked to a solid support via a reversible or irreversible bond between 
an appropriate functionality (L') on the target nucieic acid molecule (T) and an appropriate functionality (L) on the 
capture molecule (FIGURE 1 B). A reversible linkage can be such that it is cleaved under the conditions of mass spec- 
trometry (i.e., a photocleavable bond such as a charge transfer complex or a labile bond being formed between reiatively 
stable organic radicals). 

[0171 ] Photocleavable linkers are linkers that are cleaved upon exposure to light (see, e^, Goldmacher et at. (1992) 
Bioconj. Chem. 3:104-107), thereby releasing the targeted agent upon exposure to light. Photocleavablelinkers that 
are cieaved upon exposure to light are known (see, e^, Hazum et al. (1981) in Pept., Proc. Eur. Pept. Symp., 16th . 
Brunfeidt, K (Ed), pp. 105-110, which describes the use of a nitrobenzyl group as a photocleavable protective group 
for cysteine; Yen etal. (1989) Makromol. Chem 190 :69-62, which describes watersoluble photocleavable copolymers, 
including hydroxypropyfmethacrylamide copolymer, glycine copolymer, fluorescein copolymer and methylrhodamlne 
copolymer; Goldmacher etal. (1 992) Bioconj. Chem. 3 :104-1 07, which describes a cross-linker and reagent that un- 
dergoes photolytic degradation upon exposure to near UV light (350 nm); and Senter etal. (1985) Photochem. Photobioi 
42:231-237, which describes nitrobenzyioxycarbonyl chloride cross linking reagents that produce photocleavable link- 
ages), thereby releasing the targeted agent upon exposure to light. In preferred embodiments, the nucleic acid is 
immobilized using the photocleavable linker moiety that is cleaved during mass spectrometry. Presently preferred 
photocleavable linkers are set forth in the EXAMPLES, 

[0172] Furthermore, the linkage can befonmed with U being a quaternary ammonium group, In which case, preferably, 
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the surface of the solid support carries negative charges which repef the negatively charged nucieic acid backbone 
and thus facilitate the desorption required for analysis by a mass spectrometer. Desorption can occur either by the 
heat created by the laser pulse and/or, depending on L/ by specific absorption of laser energy which is in resonance 
with the L' chromophore. 

[0173] Thus, the L-L' chemistry can be of a type of disulfide bond (chemically cieavable, for example, by mercap- 
toethanoi or dithioerythroi), a biotin/slreplavidin system, a heteroblfunctional derivative of a trityl ether group (see, e. 
a,, Kosteret alj. (1990) "A Versatile Acid-Labile Linker for Modification of Synthetic Biomolecuies," Tetrahedron Letters 
31_;7095) that can be cleaved under mildly acidic conditions as well as underconditions of mass spectrometry, a levulinyl 
group cleavable under almost neutral condftions with a hydraziniunn/acetate buffer, an arginine-arginine or lysine-lysine 
bond cleavable by an endopeptidase enzyme like trypsin or a pyrophosphate bond cleavable by a pyrophosphatase, 
or a ribonucleotide bond in between the oligodeoxynucleotide sequence, which can be cleaved, for example, by a 
ribonuclease or alkali. 

[0174] The functionalities, L and L,' can also form a charge transfer compiex and thereby form the temporary L-L' 
linkage. Since in many cases the "charge-transfer band" can be determined by UV/vis spectrometry (see, e^, Organic 
Charge Transfer Complexes by R, Foster, Academic Press, 1 969), the laser energy can be tuned to the corresponding 
energy of the charge- transfer wavelength and, thus, a specific desorption off the solid support can be initiated. Those 
skilled in the art will recognize that several combinations can serve this purpose and that the donor functionality can 
be either on the solid support or coupled to the nucleic acid molecule to be detected or vice versa. 
[0175] In yet another approach, a reversible L-L' linkage can be generated by homolytically fomning relatively stable 
radicals. Under the influence of the laser pulse, desorption (as discussed above) as well as ionization will take place 
at the radical position. Those skilled In the art will recognize that other organic radicals can be selected and that, in 
relation to the dissociation energies needed to homolytically cleave the bond between them, a corresponding laser 
wavelength can be selected (see e.g. , Reactive Molecules by C. Wentrup, John Wiley & Sons, 1984). 
[0176] An anchoring function U can also be incorporated into a target capturing sequence (TCS) by using appropriate 
primers during an amplification procedure, such as PCR (FIGURE 4), LCR (FIGURE 5) or transcription ampiificatlon 
(FIGURE 6A). 

[0177] When performing exonuclease sequencing using MALDI-TOF MS, a single stranded DNA molecule immobi- 
lized via its 5-end to a solid support is unilaterally degraded with a 3' -process ive exonuclease and the molecular weight 
of the degraded nucleotide is determined sequentially. Reverse Sanger sequencing reveals the nucleotide sequence 
of the immobilized DNA. By adding a sefeclively cleavable linker, not only can the mass of the free nucleotides be 
determined but also, upon removal of the nucleotides by washing, the mass of the remaining fragment can be detected 
by MALDI-TOF upon cleaving the DNA from the solid support. Using selectively cleavable linkers, such as the photo- 
cleavable and chemical cleavable linkers provided herein, this cleavage can be selected to occur during the ionization 
and volatizing steps of MALDI-TOR The same rationale applies for a 5" immobilized strand of a double stranded DNA 
that is degraded while In a duplex. Likewise, this also applies when using a 5'-processive exonuclease and the DNA 
is immobilized through the 3'-end to the solid support, 

[0178] As noted, at least three version of immobilization are contemplated herein: 1) the target nucleic acid is am- 
plified or obtained (the target sequence or surrounding DNA sequence must be known to make primers to amplify or 
isolated); 2) the primer nucleic acid is immobilized to the solid support and the target nucleic acid is hybridized thereto 
(this is for detecting the presence of or sequencing a target sequence in a sample); or 3) a double stranded DNA 
(amplified or isolated) is immobilized through linkage to one predetermined strand, the DNA is denatured to eliminate 
the duplex and then a high concentration of a complementary primer or DNA with identity upstream from the target 
site is added and a strand displacement occurs and the primer is hybridized to the immobilized strand. 
[0179] In the embodiments where the primer nucieic acid is immobilized on the solid support and the target nucieic 
acid is hybridized thereto, the inclusion of the cleavable linker allows the primer DNA to be immobilized at the 5'-end 
so that free S'-OH is available for nucleic acid synthesis (extension) and the sequence of the "hybridized" target DNA 
can be deterniined because the hybridized template can be removed by denaturatlon and the extended DNA products 
cleaved from the solid support for MALDI-TOF MS. Similarly for 3), the immobilized DNA strand can be elongated when 
hybridized to the template and cleaved from the support. Thus, Sanger sequencing and £rimer oligo base extension 
(PROBE), discussed below, extension reactions can be perfonned using an immobilized primer of a known, upstreamn 
DNA sequence complementary to an invariable region of a target sequence. The nucleic acid from the person is ob- 
tained and the DNA sequence of a variable region (deletion, insertion, missense mutation that cause genetic predis- 
position or diseases, or the presence of viral/bacterial or fungal DNA) not only is detected, but the actual sequence 
and position of the mutation is also detemnined. 

[0180] In other cases, the target DNA must be immobilized and the primer annealed. This requires amplifying a larger 
DNA based on known sequence and then sequencing the immobilized fragments (i.e., the extended fragments are 
hybridized but not immobilized to the support as described above). In these cases, it is not desirable to include a linker 
because the MALDI-TOF spectrum is of the hybridized DNA; it is not necessary to cleave the immobilized template. 
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[0181] Any linker known to those of skill in the artfor innmobilizing nucleic acids to solid supports may be used herein 
to link the nucleic acid to a solid support. The preferred IEnl<ers herein are the selectively cleavable linkers, particularly 
those exemplified herein. Other linkers include, acid cleavable linkers, such as bismaleimideothoxy propane, acid- 
labile trttyl linkers. 

5 [0162] Acid cleavable linkers, photocleavable and heat sensitive linkers may also be used, particutarly where it may 
be necessary to cleave the targeted agent to permit it to be more readily accessible to reaction. Acid cleavable linkers 
include, but are not limited to, bismaleimideothoxy propane; and adipic acid dihydrazide linkers (see, e^, Fattom et 
§L (1992) Infection & Immun. 60:564-589) and acid labile transferrin conjugates that contain a sufficient portion of 
transferrin to permit entry into the intracellular transferrin cycling pathway (see, e.g. , Welhoner et ai. (1991) J. Biol. 

fo Chem. 266:4309-4314). 

PhotocJeavable Linkers 

[0183] Photocleavable linkers are provided. In particular, photocleavable linkers as their phosphoramidite derivatives 
^5 are provided for use tn solid phase synthesis of oligonucleotides. The linkers contain o-nitrobenzyi moieties and phos- 
phate linkages which allow for complete photolytic cleavage of the conjugates within minutes upon UV irradiation. The 
UV wavelengths used are selected so that the irradiation will not damage the oligonucleotides and are preferrably 
about 350-380 nm, more preferably 365 nm. The photocleavable linkers provided herein possess comparable coupling 
efficiency as compared to commonfy used phosphoramidite monomers (see, Sinha et aL (1 983) Tetrahedron Lett. 24: 
£0 5843-5846; Sinha et al. (1984) Nucleic Acids Res. 12:4539-4557: Beaucage et al. (1 993) Tetrahedron 49:6123-6194: 
and Matteucci et aL (1 981) J. Am. Chem. Soc. 103 :3185-3191). 
[0184] In one embodiment, the photocleavable linkers have formula I: 


25 


30 



(I) 


where R20 is a)-(4,4'-dlmethoxytrityloxy)alkyl or co-hydroxyalkyi; R^^ Is selected from hydrogen, alkyf, aryl, aikoxycar- 
bonyl, aryloxycarbonyi and carboxy; R22 is hydrogen or {dialkyiamino)((o-cyanoalkoxy)P-; t is 0-3; and R^o [s alkyi, 
alkoxy, aryl or aryloxy. 

[0185] In a preferred embodiment, the photocleavable linkers have formula II: 


45 


50 



where R^o is w-(4,4'-dimethoxytrityloxy)alkyl, (o-hydroxyaikyl oralkyi; R^^ is selected from hydrogen, alkyi, aryl, alkox- 
ycarbonyl. aryloxycarbonyi and carboxy; is hydrogen or {dialkylam(no)((i)-cyanoalkoxy)P-; and X20 js hydrogen, 
alkyi orORSO, 
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[0186] In particularly preferred embodiments, R^^ is 3-(4,4'-dimetlioxytrityloxy)propyl, 3-hydroxypropyJ or methyl; 

is selected from l^ydrogen, methyl and carboxy; R22 is hydrogen or (diisopropy!amino){2-cyanoethoxy)P-; and X^o 
is hydrogen, metliyl or OR^o. fn a more preferred embodiment, R^o Is 3-(4,4'-dimethoxytrityloxy)propyl; R21 is methyl; 

is (diisopropylamino){2-cyanoethoxy)P-; and is hydrogen. In another more preferred embodiment, r£<> is me- 
thyl; R2-I is methyl; R^a is (diisopropylamfna)(2-cyanoethoxy)P-; and X20 is 3-(4,4'-dimethoxytrityloxy)propoxy. 
[0187] In another embodiment, the photocleavable linkers have formula III: 



where r23 is hydrogen or (dialkylamino)({o-cyanoalkoxy)P-; and R^* is selected from w-hydroxyaikoxy, a)-(4,4'-dimeth- 
oxytrityloxy)alkoxy, to- hydroxy alkyl and o)-(4,4'-dimethoxytrity(oxy)alkyl, and is unsubstituted or substituted on the alkyi 
or aikoxy chairs with one or more alkyi groups; r and s are each independently 0-4; and R^ is alkyi, alkoxy, aryl or 
aryloxy. In certain embodiments, R^* is co-hydroxyalkyi or a>-{4,4'-dimethoxytrityloxy)alkyl, and Is substituted on the 
alkyf chain with a methyl group. 

[01 88] In preferred embodiments, R^a is hydrogen or (dlisopropylamino)(2-cyanoethoxy)Ps and R24 is selected from 
3-hydroxypropoxy, 3-(4,4'-dimethoxytrityloxy)propoxy, 4-hydroxybutyl, 3 -hydroxy-1 -propyl, 1-hydroxy-2-propyl, S-hy- 
droxy-2-methy 1-1 -propyl, 2-hydroxy ethyl, hydroxymethyi, 4-(4,4'-dimethoxytrityloxy)butyl, 3-C4,4'-dimethoxytrjtyloxy)- 
1 -propyl, 2-(4,4'-dimethoxytrityloxy)ethyl, 1 -(4,4'-dimethoxytrityloxy)-2-propyi, 3-(4,4'-dimethoxytriyloxy)-2-methy(- 
1 -propyl and 4, 4'-dimethyoxytrltyloxy methyl. 

[0189] In more preferred embodiments, R23 is (diisopropylamino)(2-cyanoethoxy)P-; rand s are O; and R^* is se- 
lected from 3-(4,4'-dimethoxytrityloxy)propoxy, 4-(4,4'-dimethoxytrttyloxy)butyl. 3-(4,4'-dimethoxytrityloxy)propyl, 2- 
(4,4'-dimethoxytrityioxy)ethyl, 1-(4,4'-dimothoxytrityloxy)-2-propyl, 3-(4,4'-dimethoxytriyloxy)^2-melhy 1-1 -propyl and 
4,4'-dimethyoxytrityroxymethyl. R^* is most preferably 3-(4,4'-dimethoxytrityloxy)propoxy. 

Preparation of the photocleavable linkers 

A. Preparation of photocleavable linkers of formulae I or II 

[0190] Photocleavable linkers of formulae I or II may be prepared by the methods described below, by minor modi- 
fication of the methods by choosing the appropriate starting materials or by any other methods known to those of skill 
in the art. Detailed procedures for the synthesis of photocleavable linkers of formula II are provided in the Examples. 
[0191] In the photocleavable linkers of formula II where X^o is hydrogen, the linkers may be prepared in the following 
manner. Alkyiation of 5-hydroxy-2-nitrobenzaldehyde with an oo-hydroxyalkyl halide, e^, 3-hydroxypropyl bromide, 
followed by protection of the resulting alcohol as, e^, a silyl ether, provides a 5-(a)-sllyloxyalkoxy)-2-nitrobenzaldehyde. 
Addition of an organometallic to the aldehyde affords a benzyiic alcohol. Organometallics which may be used include 
trialkylalumtnums (for linkers where R21 is alkyi), such as trimethylaluminum, borohydrides (for linkers where R21 is 
hydrogen), such as sodium borohydride, or metal cyanides (for linkers where R^^ is carboxy or alkoxycarbonyl), such 
as potassium cyanide. In the case of the metal cyanides, the product of the reaction, a cyanohydrin, would then be 
hydrolyzed under either acidic or basic conditions in the presence of either water or an alcohol to afford the compounds 
of interest. 

[0192] The silyl group of the side chain of the resulting benzyiic alcohols may then be exchanged for a 4,4'-dlmelh- 
oxytriyl group by desilylation with, e.g. , tetrabutylammonium fluoride, to give the corresponding alcohol, followed by 
reaction with 4,4'-dlmethoxytrityl chloride. Reaction with, e^, 2-cyanoethyl diisopropylchlorophosphoramldite affords 
the linkers where R22 is {diatkylamino)(a)-cyanoalkoxy)P-. 

[0193] A specific example of a synthesis of a photocleavable linker of formula II is shown in the following scheme, 
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which also demonstrates use of the linker in oligonucleotide synthesis. This scheme is intended to be illustrative only 
and In no way limits the scope of the invention. Experimental details of these synthetic transformations are provided 
in the Examples. 


HO 


HO 



NO 


cat. Kt 


CHO 
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[01d4] Synthesfs of the linkers of formula II where X^o |s OR^o^ 3,4-dihydroxyacetophenone is protected selectiveJy 
at the 4-hyclroxyl by reaction with, e^, potassium carbonate and a silyl chloride, Benzoate esteres, propiophenones, 
butyrophenones, etc. may be used in place of the acetophenone. The resulting 4-silyioxy-3-hydroxyacetophenone is 
then aticylated at the with an aikyi halide (for linkers where R^o is alkyi) at the 3-hydroxy) and desilylated with, e.g. , 
tetrabuyiannnnonium fluoride to afford a 3-alkoxy'4-hydroxyacetophenone. This compound is then alkylated atthe4'hy- 
droxyi by reaction with an a>-hydroxyalkyl halide, e^, 3-hydroxypropyl bromide, to gfve a 4-{o>-hydroxyalkoxy)-3-alkoxy- 
acetophenone. The side chain alcohol is then protected as an ester, e^. an acetate. This compound Is then nitrated 
at the 6-positlon with, e^, concentrated nitric acid to provide the corresponding 2-nitroacetop hen ones. Saponification 
of the side chain ester with, e.g. , potassium carbonate, and reduction of the ketone with, e.g. , sodium borohydride. In 
either order gives a 2-nitro-4-(ro-hydro)cyalkoxy)-5-alkoxyben2yfic alcohol. 

[0195] Selective protection of the side chain alcohol as the corresponding 4,4'-dimethoxytrityl ether is then accom- 
plished by reaction with 4,4'-dimethoxytrityl chloride. Further reaction with, e.g^ , 2-cyanoethyl diisopropyichlorophos- 
phoramidite affords the linkers where R22 is {dialkylamino)(a)-cyanoalkoxy)P-. 

[0196] A specific exannple of the synthesis of a photocleavable linker of formula II is shown the following scheme. 
This scheme is intended to be illuatralive only and in no way limit the scope of the invention. Detailed experimental 
procedures for the transformations shown are found in the Examples. 
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B. Preparation of photocleavable linkers of formula iii 

[0197] Photocleavable linkers of formula III may be prepared by the methods described below, by minor modification 
of the methods by choosing appropriate starting materials, or by other methods known to those of skill in the art. 
[0198] In general, photocleavable linkers of fonmula III are prepared from co-hydroxyalkyl- or alkoxyary I compounds, 
in particular a>- hydroxy -alky I or alkoxy-benzenes. These compounds are commercially available, or may be prepared 
from an o>-hydroxyalkyf halide (e.g. , 3-hydroxy propyl bromide) and either phenylfithium (for the co-hydroxyalkylben- 
zenes) or phenol (for the w-hydroxyalkoxybenzenes), Acylatton of the oo-hydroxyl group (e.c[^, as an acetate ester) 
followed by Friedel-Crafts acylation of the aromatic ring with 2-ritrobenzoyl chloride provides a 4-(<o-acetoxy-alkyl or 
alkoxy)-2-nitrobenzQphenone. Reduction of the ketone with, e^, sodium borohydride, and saponification of the side 
chain ester are performed in either order to afford a 2-nltrophenyl-4-(hydroxy'alkyl or alkoxy)phenylmethanol. Protec- 
tion of the terminal hydroxy! group as the corresponding 4,4'-dimethoxytrityl ether is achieved by reaction with 4,4'- 
dimethoxytrityi chloride. The benzylic hydroxyl group fs then reacted with, e^, 2-cyanoethyl diisopropylchlorophos- 
phoramidite to afford linkers of formula II where R23 is (dialkyiaminc)(u)-cyanoalkoxy)P-. 

[0199] Other photocleavable linkers of fomnula III may be prepared by substituting 2-phenyi-1 -proparoi or 2-phenyl- 
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methyl- 1 -propanot for the (o-hydroxy-alkyl or alkoxy-benzenes in the above synthesis. These compounds are commer- 
ctaliy availabie, but may aiso beprepared by reaction of, e^, phenylmagneslum bromide orbenzylmagnesium bromide, 
with the requisite oxirane (i.e. . propylene oxide) in the presence of cataJytic cuprous ion. 

Chemically cleavable linkers 

[0200] A variety of chemically cleavable iinkers may be used to introduce a cleavabJe bond between the immobilized 
rrucleic acid and the soiid support, Acid-iabiie linkers are presently preferred chemicalty cleavabie linkers for mass 
spectrometry, especially MALDi-TOF MS, because the acid labile bond is cleaved during conditioning of the nucleic 
acid upon addition of the 3-HPA matrix solution. The acid labile bond can be introduced as a separate linker group, e. 
3. , the acid labile trityl groups (see Figure 68; Example 1 6) or may be incorporated in a synthetic nucleic acid linker by 
introducing one or more silyl irrternucleoside bridges using diisopropylsilyl, thereby forming diisopropylsilyl-linl<ed oli- 
gonucleotide analogs. The diisopropylsilyi bridge replaces the phoshodiester bond in the DNA backbone and under 
mildly acidic conditions, such as 1 .5% trifluoroacetic acid (TFA) or 3-HPA/1% TFA MALDI-TOF matrix solution, results 
in the introduction of one or more intra-strand breaks in the DNA molecule. Methods for the preparation of diisopro- 
pylsilyl-iinked oligonucleotide precursors and analogs are known to those of skill in the art (see e^, Saha et aL (1 993) 
J. . Org. Chem. 58 :7827-7831 ). These oligonucleotide analogs maybe readily prepared using solid state oligonucleotide 
synthesis methods using diisopropylsilyl derivatized deoxyribonucieosides. 

Nucleic acid conditioning 

[0201] Prior to mass spectrometric analysis, it may be usefut to "condition" nucleic acid molecules, for example to 
decrease the laser energy required for volatilization and/or to minimize fragmentation. Conditioning is preferably per- 
formed while a target detection site is immobilized. An example of conditioning is modification of the phosphodiester 
backbone of the nucleic acid molecule ( e.g. , cation exchange), which can be useful for eliminating peak broadening 
due to a heterogeneity in the cations bound per nucleotide unit. Contacting a nucleic acid molecule with an alkylating 
agent such as akyiiodide, iodoacetamide, p-iodoethanol, or 2,3-epoxy-1 -propanoic the monothio phosphodiester bonds 
of a nucleic acid molecule can be transformed into a phosphotri ester bond. Likewise, phosphodiester bonds may be 
transformed to uncharged derivatives employing trialkylsiiyi chlorides. Further conditioning involves incorporating nu- 
cleotides that reduce sensitivity for depurination (fragmentation during MS) e.g. , a purine analog such as N7- or N9-dea- 
zapurine nucleotides, or RNA building blocks or using oligonucleotide triesters or incorporating phosphorothioate func- 
tions which are alkylated or employing oligonijcleotide mimetics such as PNA. 

Multiplex reactions 

[0202] For certain applications, it may be useful to simultaneously detect more than one (mutated) loci on a particular 
captured nucleic acid fragment (on one spot of an array) or it may be useful to perform parallel processing by using 
oligonucleotide or oligonucleotide mimetic arrays on various solid supports. '^Multiplexing" can be achieved by several 
different methodologies. For example, several mutations can be simultaneously detected on one target sequence by 
employing corresponding detector (probe) molecules (e.g. , oligonucleotides or oligonucleotide mimetics). The molec- 
ular weight differences between the detector oligonucleotides D1, D2 and D3 must be large enough so that simulta- 
neous detection (multiplexing) is possible. This can be achieved either by the sequence itself (composition or length) 
or by the introduction of mass-modifying functionalities M1 -M3 into the detector oligonucleotide (seel FIGURE 2). 

Ma68 modification of nucleic acids 

[0203] Mass modifying moieties can be attached, for instance, to either the 5'-end of the oligonucleotide (M""), to the 
nucleobase (or bases) (M^, M^), to the phosphate backbone (M^), and to the2'-position of the nucleoside (nucleosides) 
(M'^, M^) and/or to thetenninal 3*-positlon (M®). Examples of mass modifying moieties include, for example, a halogen, 
an azido, or of the type, XR, wherein X Is a linking group and R is a mass-modifying functionality. The mass-modifying 
functionality can thus be used to introduce defined mass increments into the oligonucieotide molecule. 
[0204] The mass-modifying functionality can be located at different positions within the nucleotide moiety (see, e.g. . 
U.S. Patent No. 5,547,835 and International PCT application No. WO 94/21822). For example, the mass-modifying 
moiety, M, can be attached either to the nucleobase, M^ (in case of the -deazanucleosides also to C-7, M^), to the 
triphosphate group at the alpha phosphate, M^, or to the 2'-position of the sugar ring of the nucleoside triphosphate, 
M^ and M^, Modifications introduced at the phosphodiester bond (M4), such as with alpha-thio nucleoside triphos- 
phates, have the advantage that these modifications do not interfere with accurate Watson-Crick base-pairing and 
additionally allow for the one-step post-synthetic site-specific modiffcation of the complete nucleic acid molecule e.g. . 
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via alkylation reactions (see, e.g. , Nakamaye et aL (1988) Nucl. Acids Res. 16 :9947-59). Particuiariy preferred mass- 
modifying functionalities are boron-modified nucieic acids since they are better incorporated into nucleic acids by 
polymerases (see, e,a.. Porter etal. (1995) Biochemistry 34:11963-11969; Hasan et al. (1996) Nucleic Acids Res. 24: 
2150-2157; Li et al. (1995) Ntjcl. Acids Res. 23:4495-4501). 

[0205] Furthermore, the mass-modifying functionality can be added so as to affect chain termination, such as by 
attaching it to the 3'-position of the sugar ring in the nucleoside triphosphate, M^. For those skilled in the art, It is clear 
that many combinations can be used in the methods provided herein. In the same way, those skilled in the art will 
recognize that chain-efongating nucleoside triphosphates can also be mass-modified in asimilarfashion with numerous 
variations and combinations in functionality and attachment positions. 

[0206] Without being bound to any particular theory, the mass-modification, M, can be introduced for X in XR as well 
as using oligo-Zpoiyethylene glycol derivatives for R. The mass-modrfying increment in this case is 44, i.e. five different 
mass-modified species can be generated by just changing m from 0 to 4 thus adding mass units of 45 (m=0), 89 (m = 
1), 133 (m = 2), 177 (m ^ 3) and 221 (m = 4) to the nucleic acid molecule (e.g. , detector oligonucleotide (D) or the 
nucleoside triphosphates (FIGURE 6(C)), respectively). The oligo/polyelhylene glycols can also be monoaikyiated by 
a lower aikyi such as methyl, ethyl, propyl, isopropyi, t-butyi and the like. A selection of linking functionalities, X, are 
also illustrated. Other chemistries can be used in the mass-modified compounds (see, e^, those described in Oligo- 
nucleotides and Analogues, A Practical Approach , F. Eckstein, editor, IRL Press, Oxford, 1991). 
[0207] In yet another embodiment, various mass-modifying functionalities, R, other than oligo/polyethylene glycols, 
can be selected and attached via appropriate linking chemistries, X. A simple mass-modification can be achieved by 
substituting H for halogens like F, CI, Br and/or 1, or pseudohalogens such as CN, SCN, NCS, or by using different 
alkyi, aryl or aralkyi moieties such as methyl, ethyl, propyl, isopropyi, t-butyl, hexyi, phenyl, substituted phenyl, benzyl, 
or functional groups such as CH2F, CHFa, CF3, Si(CH3)3, Si(CH3)2(C2Hs), Sj{CH3)(C2H5)2, Si(C2H5)3. Yet another 
mass-modification can be obtained by attaching homo- or heteropeptldes through the nucleic acid molecule (e.g. , 
detector (D)) or nucleoside triphosphates. One example, useful in generating mass-modified species with a mass 
increment of 57. is the attachment of oligoglycines, e.g. , mass-modifications of 74 (r-l , m=0), 131 (r=1 , m = 1), 188 
(r = 1 , m r: 2), 245 (r = 1 , m = 3) are achieved. Stmpie oligoamides also can be used, e.g. , mass -modifications of 74 
(r = 1, m = 0), 88 (r=2, m = 0), 1 02 (r=3, m = 0), 116(r-4, m = 0), etc. are obtainable. Variations in additions to those 
set forth herein will be apparent to the skilled artisan. 

[0208] Different mass-modified detector oligonucleotides can be used to simultaneously detect all possible variants/ 
mutants simultaneously (FIGURE 6B). Alternatively, allfourbase pemnutationsatthesite of a mutation can be detected 
by designing and positioning a detector oligonucleotide, so that tt serves as a primer for a DNA/RNA potymerase with 
varying combinations of elongating and terminating nucleoside triphosphates (FIGURE 6C). For example^ mass mod- 
ifications also can be incorporated during the amplification process. 

[0209] FIGU RE 3 shows a different multiplex detection fomriat, in which differentiation is accomplished by employing 
different specific capture sequences which are position-specifically immobilized on a flat surface (e^. , a 'chip array'), 
if different target sequences Tl-Tn are present, their target capture sites TCSI-TCSn will specifically interact with 
complementary immobilized capture sequences Cl-Cn. Detection is achieved by employing appropriateEy mass dif- 
ferentiated detector oligonucleotides D1-Dn, which are mass modifying functionalities Ml-Mn. 


40 Mass spectrometrlc methods for sequencing DNA 


[0210] Amenable mass spectrometrlc formats for use herein include the ionization (1) techniques, such as matrix 
assisted laser desorption ionization (MALDI), electrospray (ESI) (e.g. , continuous or pulsed); and related methods (e. 

lonspray, Thermospray, Fast Atomic Bombardment), and massive cluster impact (MCI); these ion sources can be 
matched with detection formats including lin -linear fields) time-of-flight (TOP), single or multiple quadrupole, single or 
multiple magnetic sector, Fourier transform ion cyclotron resonance (FTICR), ion trap, or combinations of these to give 
a hybrid detector (e,g. , ion trap - time of flight). For ionization, numerous matrix/wavelength combinations including 
frozen anatyte preparation (MALDI) or solvent combinations (ESI) can be employed. 

[0211] Since a nonnal DNA molecule includes four nucleotide units (A, T, C, G), and the mass of each of these is 
unique (monoisotopic masses 313.06, 304.05, 289.05, 329.05 Da, respectively), an accurate mass determination can 
define or constrain the possible base compositions of that DNA. Only above 4900 Da does each unit molecular weight 
have at least one allowable composition; among all 6-mers there is only one non-unique nom//?a/ molecular weight, 
among 8-mers, 20. For these and larger oligonucleotides, such mass overlaps can be resolved with the — 1/1 0^ 
1 0 part per miliion, ppm) mass accuracy available with high resolution FTICR MS. For the 25-mer AgTgo, the 20 com- 
position degeneracies when measured at ±0.5 Da is reduced to three (AgT^o, T4Ci£G9, AT3C4Gie) when measured 
with 2 ppm accuracy. Given composition constraints (e.g. , the presence or absence of one of the four bases in the 
strand) can reduce this further (see below). 

[0212] Medium resolution instrumentation, including but not exclusively curved field reflectron or delayed extraction 
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time-of-ftighl MS instruments, can also result in innproved DNA detection for sequencing or diagnostics. Either of these 
are capable of detecting a 9 Da (Am (A-T)) sl^ift in ^30-mer strands generated from, for exampie primer oiigo base 
extension (PROBE), or competitive oligonucleotide sin gie base extension (COSBE), sequencing, or direct detectron 
of smail ampiified products. 

BiomassScan 

[0213] In this embodiment, exempiified In Example 33, two single stranded nucleic acids are individually immobilized 
to solid supports. One support contains a nucleic acid encoding the wild type sequence whereas the other support 
contains a nucleic acid encoding a mutant target sequence. Total human genomic DNA is digested with one or more 
restriction endonuclease enzyme resulting in the production of small fragments of double stranded genomic DNA 
(10-1,000 bp). The digested DNA is incubated with the immobilized single stranded nucleic acids and the sample Is 
heated to denature the DNA duplex. The immobilized nucleic acid competes with the other genomic DNA strand for 
the complementary DNA strand and under the appropriate conditions, a portion of the compiementary DNA strand 
hybridizes to the immobilized nucleic acid resulting in a strand displacement. By using high stringency washing con- 
ditions, the two nucleic acids will remain as a DNA duplex only if there is exact identity between the immobilized nucleic 
add and the genomic DNA strand. The DNA that remains hybridized to the immobilized nucleic acid is analyzed by 
mass spectrometry and detection of a signal in the mass spectrum of the appropriate mass is diagnostic for the wild 
type or mutant allele. In this manner, total genomic DNA can be isolated from a biological sample and screened for 
the presence or absence of certain mutations. By immobilizing a variety of single stranded nucleic acids in an anray 
format, a panel of mutations may be simultaneously screened for a nunnber of genetic loci (i.e. , muFtiplexing). 
[0214] In addition, using less stringent washing conditions the hybridized DNA strand may be analyzed by mass 
spectrometry for changes in the mass resulting from a deletion or insertion within the targeted restriction endonuclease 
fragment. 

Primer oligonucleotide base extension 

[0215] As described in detail in the following Exampie 11 , the p rimer oligo base extension (PROBE) method combined 
with mass spectrometry identifies the exact number of repeat units (Le. the number of nucleotides in homogenous 
stretches) as well as second site mutations within a polymorphic region, which are otherwise only detectable by se- 
quencing. Thus, the PROBE technique increases the total number of detectable alleles at a distinct genomic site, 
leading to a higher polymorphism Infomnation content (PIC) and yielding a far more definitive identification in for instance 
statistics-based analyses in paternity orforenslcs applications. 

[0216] The method is based on the extension of a detection primer that anneals adjacent to a variable nucleotide 
tandem repeat (VNTR) or a polymorphic mononucleotide stretch using a DNA polymerase in the presence of a mixture 
of deoxyNTPs and those dideoxyNTPs that are not present in the deoxy form. The resulting products are evaluated 
and resolved by MALDI-TOFmass spectrometry with out further labeling of the DNA. In a simulated routine application 
with 28 unrelated individuals, the mass error of this procedure using external calibration was in the worst case 0.38% 
(56-mer), which is comparable to approximately 0.1 base accuracy; routine standard mass deviations are in the range 
of 0.1% (,03 bases). Such accuracy with conventional electrophoretic methods is not realistic, underscoring the value 
of PROBE and mass spectrometry in forensic medicine and paternity testing. 

[0217] The uttra-high resolution of Fourier Transform mass spectrometry maJ<es possible the simultaneous meas- 
urement of all reactions of a Sanger or Maxam Gilbert sequencing experiment, since the sequence may be read from 
mass differences instead of base counting from 4 tubes. 

[0218] Additionally, the mass differences between adjacent bases generated from unilateral degradation in a step- 
wise manner by an exonuclease can be used to read the entire sequence of fragments generated. Whereas UV or 
fluorescent measurements will not discriminate mixtures of the nucleoside/nucieotide which are generated when the 
exonuclease enzyme gets out of phase, this is no problem with mass spectrometry since the resolving power in differ- 
entiating between the molecular mass of dA, dT, dG and dC is more than significant. The mass of the adjacent bases 
(Le., nucleotides) can be determined, for example, using Fast Atomic Bombardment (FAB) or Electronspray ionization 
(ESI) mass spectrometry. 

[0219] New mutation screening over an entire amplified product can be achieved by searching for mass shifted 
fragments generated in an endonuclease digestion as described in detail in the following Examples 4 and 12, 
[0220] Partial sequence information obtained from tandem mass spectrometry (MS") can place composition con- 
straints as described in the preceding paragraph. For the 25-mer above, generation of two fragment ions formed by 
collisionally activated dissociation (CAD) which differ by 313 Da discounts T4C-,2G9. which contains no A nucleotides; 
confimning more than a single A eliminates AT3C4G-|g as a possible composition. 

[0221] MS" can also be used to detenmined full or partial sequences of larger DNAs; this can be used to detect, 
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locate, and identify new mutations in a given gene region. Enzymatic digest products whose masses are correct need 
not be furtlier analyzed; tliose with mass shifts could be isolated in reai time from the complex mixture in the mass 
spectrometer and partially sequenced to locate the new mutation. 

[0222] Table I describes the mutation/polymorphism detection tests that have been developed. 


Table 1 


Mutation/Polymorphism Detection Tests 

Clinical Association 

Gene 

Mutation/PoJymorph ism 

Cystic Fibrosis 

CFTR 

38 disease causing mutations in 14 exons/introns 

Heart nL<%paAp ^rihnl^^tpml 
MetaboNsm) 

Ann P 

Apo A-IV 
Apo B-1 00 

1 i^n, 1 loan, looo 
3478, 347T, 360H, 360Q 
3500Q, 3500R 

Thyroid Cancer 

RET proto- 
oncogene 

C634W, C634T, C634R, 
C634S, C634F 

Sickle Cell Anemia/ Thalassemia 

beta-globin 

Sickle cell anemia S and C 45 thalassemia alleles 

HIV Susceptibility 

CKR-5 

32bp deletion 

Breast Cancer Susceptibility 

BRCA-2 

2bp (AG) deletion in exon 2 

Thrombosis 

Factor V 

R506Q 

Arteriosclerosis 

Gpllla E-selectin 

L33P S12SR 

j Hypertension 

ACE 

l/D polymorphism 


Detection of mutations 


Diagnosis of genetic diseases 

[0223] The mass spectrometric processes described above can be used, for example, to diagnose any of the more 
than 3000 genetic diseases currently known ( e.g. , hemophilias, thalassemias, Duchenne Muscular Dystrophy (DMD), 
Huntington's Disease (HD), Alzheimer's Disease and Cystic Fibrosis (CF)) or to be identified. 
[0224] The following Example 3 provides a mass spectrometric method for detecting a mutation (AF50B) of the cystic 
fibrosis transmembrane conductance regulator gene (CFTR), which differs by only three base pairs (900 daltons) from 
the wild type of CFTR gene. As described further in Example 3, the detection is based on a single-tube, competitive 
otigonucleotide single base extension (COSBE) reaction using a pair of primers with the 3'-terminal base complemen- 
tary to either the normal or mutant allele. Upon hybridization and addition of a polymerase and the nucleoside triphos- 
phate one base downstream, only those primers properly annealed (i..e, no 3'-temiinal mismatch) are extended; prod- 
Lfcts are resolved by molecular weight shifts as detennined by matrix assisted laser desorption ionization time-of-f light 
mass spectrometry. For the cystic fibrosis AF508 polymorphism, 28-mer 'normal' (N) and 30-mer 'mutant' (M) primers 
generate 29- and 31 -mersfor N and M homozygotes, respectively, and both for heterozygotes. Since primer and product 
molecular weights are relatively low (< 1 0 id3a) and the mass difference between these are at least that of a single - 
300 Da nucleotide unit, low resolution instrumentation is suitable for such measurements. 

[0225] Thermosequence cycle sequencing, as further described in Example 11 , is also useful for detecting a genetic 
disease. 

[0226] In addition to mutated genes, which result in genetic disease, certain birth defects are the result of chromo- 
somal abnormalities such as Trisomy 21 (Down's Syndrome), Trisomy 13 (Patau Syndrome), Trisomy 18 (Edward's 
Syndrome), Monosomy X (Turner's Syndrome) and other sex chromosome aneuploidies such as Klienfelter's Syn- 
drome (XXY). Here, "ho use- keeping" genes encoded by the chromosome in question are present in different quantity 
and the different amount of an amplified fragment compared to the amount in a nomnal chromosomal configuration can 
be detemnined by mass spectrometry. 

[0227] Further, there Is growing evidence that certain DNA sequences may predispose an individual to any of a 
number of diseases such as diabetes, arteriosclerosis, obesity, various autoimmune diseases and cancer ( e.g. , color- 
ectal, breast, ovarian, lung). Also, the detection of "DNA fingerprints", e.g. , polymorphisms, such as "mini- and micro- 
satellite sequences", are useful for determining identity or heredity (e.g. , paternity or maternity). 
[0228] The following Examples 4 and 12 provide mass spectrometer based methods for identifying any of the three 
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different isoforms of human apo lipoprotein E, which are coded by the E2, E3 and E4 alleles. For example, the molecular 
weights of DNA fragments obtained after restriction with appropriate restriction endonucleases can be used to detect 
the presence of a mutation and/or a specific allele. 

[0229] Depending on the biological sample, the diagnosis for a genetic disease, chromosomaf aneuploidy or genetic 
predisposition can be preformed either pre- or post-natally. 

DEagnosIs of cancer 

[0230] Preferred mass spectrometer-based methods for providing an early indication of the existence of a tumor or 
a cancer are provide herein. For example, as described in Example 13, the telomeric repeat amplification protocol 
(TRAP) in conjunction with telomerase specific extensfon of a substrate primer and a subsequent amplification of the 
telomerase specific extension producte by an amplification step using a second primer complementary to the repeat 
structure was used to obtain extension ladders, that were easily detected by MALDI-TOF mass spectrometry as an 
indication of telomerase activity and therefor tumorigenesfs. 

[0231] Alternatlveiy, as described in Example 14, expression of a tumor or cancer associated gone ( e.g. . human 
tyrosine 5-hydroxylase) via FTT-PCR and analysis of the amplified products by mass spectrometry can be used to detect 
the tumor or cancer (e.g. , biosynthesis of catecholamine via tyrosine 5-hydroxylase is a characteristic of neuroblasto- 
ma). 

[0232] Further, a primer oligo base extension reaction and detection of products by mass spectrometry provides a 
rapid means for detecting the presence of oncogenes, such as the RET proto oncogene codon 634, which is related 
to causing multiple endocrine neopJasia, type II (MEN II), as described in Example 15. 

Diagnosis of Infection 

[0233] Viruses, bacteria, fungi and other infectious organisms contain distinct nucleic acid sequences, which are 
different from the sequences contained in the host celL Detecting or quantitating nucleic acid sequences that are 
specific to the Infectious organism is Important for diagnosing or monitoring infection. Examples of disease causing 
viruses that infect humans and animals and which may be detected by the disclosed processes include: Retroviridae 
(e^, human immunodeficiency viruses, such as HiV-1 (also referred to as HTLV-III, LAV or HTLV-III/LAV, see, e^, 
Ratner et al. (1 985) Nature 313 : 227-284; Wain-Hobson et al, (1 985) Cell 40:9-1 7); H I V-2 (see, Guyader et aL (1 987) 
Nature 328:662-669 European Patent Publication No. 0 269 520; Chakrabarti et aL (1987) Nature 328 :543-547: and 
European Patent Application No. 0 655 501); and other isolates, such as HIV-LP (international PCT application No. 
WO 94/00562 entitled VI Novet Human immunodeficiency Virus"] Picomavindae (e^, polio viruses, hepatitis A virus, 
(see, e^. Gust et al. (1 983) Intervirofogy 20:1 -7>: entero viruses, human coxsackieviruses, rhinoviruses, echoviruses); 
Ca/cMridae (e^, strains that cause gastroenteritis); Togaviridae ( e.g. , equine encephalitis viruses, rubella viruses); 
FIavindae(e^, dengue viruses, encephalitis viruses, yellow fever viruses); Coronavihdae (e.g.. coronavi ruses); Rhab- 
dovirldae (e^, vesicular stomatitis viruses, rabies viruses); FitoviridaB (e.g. . ebola viruses); Paramyxovindae (e.g.. 
parainfluenza viruses, mumps virus, measles virus, respiratory syncytial virus); Orthomyxoviridae (e.g. . influenza vi^ 
ruses); Bungavtridae (e^, Hantaan viruses, bunga viruses, phleboviruses and Nairo viruses); Arena viridae (hemor- 
rhagic fever viruses); Reovfrfdae (e^, reoviruses. orbiviruses and rotaviruses); Birnaviridae, Hepadnaviridae(HepaX\X\s 
B virus); Parvovin'dae (parvoviruses); Papovaviridae (papiitoma viruses, polyoma viruses); Adenovin'dae (most aden- 
oviruses); Herpesviridae (herpes simplex virus (HSV) 1 and 2, varicella zoster virus, cytomegaovlrus (CMV), herpes 
viruses'); Poxvtridae (variola viruses, vaccinia viruses, pox viruses); and Indoviridae (e^, African swine fever virus); 
and unclassified viruses (e^, the etiological agents of Sponglfonn encephalopathies, the agent of delta hepatitis 
(thought to be a defective satellite of hepatitis B virus), the agents of non-A, non-B hepatitis (class 1 = internally trans- 
mitted; class 2 = parenteraily transmitted (i.e., Hepatitis C); Norwalk and related viruses, and astroviruses). 
[0234] Examples of infectious bacteria include, but are not limited to: i-telicobacter pyioris, Borefia burgdorferi, Le- 
gionefia pneumophiiia, Mycobacteria sps (e^, yw. tubercuiosis, M. avium, M. intraceiiufare, M. i<ansa/i, M. gordonae), 
Staphylococcus aureus. Neisseria gonorrhoeae. Neisseria meningitidis. Listeria monocytogenes. Streptococcus pyo~ 
genes (Group A Streptococcus), Streptococcus agaiactiae (Group B Streptococcus), Streptococcus (viridans group). 
Streptococcus faecatis, Streptococcus bovis, Streptococcus (anaerobic sps.). Streptococcus phhenumoniae, patho- 
genic Campylobacter sp., Enterococcussp., Haemophiius inffuenzae, Baciiius antracis, corynebacterium diphtheriae, 
corynebacteriumsp,, Erysipeiothrix rhusiopathiae, Ciostridtum perf ringers, Ciostridium tetani, Enterobacteraerogenes. 
Ktebsiefia pneumoniae, Pastureiia muitocida, Bacteroidsssp., Fusobacterium nucieatum, Streptobaciifusmaniliformis, 
Treponema paflidium, Treponema pertenue, Leptospira, and Actinomyces iaraeiti. 

[0235] Examples of infectious fungi include: Cryptococcus neoformans, Histopiasma capsuiatum, Coccidioides im- 
mitis, Biastomyces dermatitidis, Chiamydia trachomatis, Candida albicans. Other infectious organisms (i.e., protists) 
include: Ptasmodium faiciparum and Toxoplasma gondii. 
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[0236] The processes provided herein makes use of tlie l<nown sequence informatton of the target sequence and 
known nnutation sites. Although new mutattons can also be detected. For example, as shown in FIGURE 8, transcription 
of a nucleic acid nrtolecule obtained from a biological sample can be specifically digested using one or more nucleases 
and the fragments captured on a solid support carrying the corresponding complementary nucleic acid sequences. 
Detection of hybridization and the molecular weights of the captured target sequences provide information on whether 
and where in a gene a mutation is present. Alternatively, DNA can be cleaved by one or more specific endonucleases 
to form a mixture of fragments. Comparison of the molecular weights between wildtype and mutant fragment mixtures 
results in mutation detection. 

Sequencing by generation of specifically terndlnated fragements 

[0237] In another embodiment, an accurate sequence determination of a relatively large target nucleic acid, can be 
obtained by generating specificaJly terminated fragments from the target nucleic acid, determining the mass of each 
fragment by mass spectrometry and ordering the fragments to determine the sequence of the larger target nucleic 
acid. Jn a preferred embodiment, the specifically terminated fragments are partial or complete base-specifically termi- 
nated fragments. 

[0238] One method for generating base specifically tenninated fragments involves using a base-specific ribo nuclease 
after e^, a transcription reaction. Preferred base-specific ribonucleases are selected from among: T^-ribonuclease 
(G-specific), Ua-ribonuciease (A-specific), PhyM-ribonuclease U specific and ribonuclease A (U/C specific). Other ef- 
ficient and base-specific ribonucleases can be identified using the assay described in Example 21 . Preferably modified 
nucleotides are included in the transcription reaction with unmodified nucleotides. Most preferably, the modified nucle- 
otides and unmodified nucleotides are added to the transcription reaction at appropriate concentrations^ so that both 
moieties are incorporated at a preferential rate of about 1 :1 . Alternatively, two separate transcriptions of the target DNA 
sequence one with the modified and one with the unmodified nucleotides can be perfomned and the results compared. 
Preferred modified nucleotides include: boron or bromine modified nucleotides (Porter etal (1995) Biochemistry 34 : 
1 1 963-1 1 969; Hasan et aL (1 996) Nucl. Acids Res. 24:2 1 50-2 1 57; Li et al (1 995) Nucleic Acids Res. 23:4495-4501 ) , 
a-thio- modified nucleotides, as well as mass-modified nucleotides as described above. 

[0239] Another method for generating base specifically tenminated fragments involves performing a combined am- 
plification and base-specific temnination reaction. For example, a combined amplification and temiinatlon reaction can 
be performed using at least two different polymerase enzymes, each having a different affinity for the chain terminating 
nucleotide, so that polymerization by an enzyme with relatively low affinity for the chain terminating nucleotide leads 
to exponential amplification whereas an enzyme with relatively high affinity for the chain terminating nucleotide termi- 
nates the polymerization and yields sequencing products. 

[0240] The combined amplification and sequencing can be based on any amplification procedure that employs an 
enzyme with polynucleotide synthetic ability ( e.g. , polymerase). One preferred process^ based on the polymerase chain 
reaction (PGR), includes the following three thermal steps: 1) denaturing a double stranded (ds) DNA molecule at an 
appropriate temperature and for an appropriate period of time to obtain the two singie stranded (ss) DNA molecules 
(the template: sense and antisense strand); 2) contacting the template with at least one primer that hybridizes to at 
least one ss DNA template at an appropriate temperature and for an appropriate period of time to obtain a primer 
containing ss DNA template; 3) contacting the primer containing template at an appropriate temperature and for an 
appropriate period of time with; (i) a complete set of chain elongating nucleotides, (ii) at least one chain terminating 
nucleotide, (iii) a first DNA polymerase, which has a relatively low affinity towards the chain terminating nucleotide; 
and (iv) a second DNA polymerase, which has a relatively high affinity towards the chain terminating nucleotide. 
[0241] Steps 1)-3) can be sequentially performed for an appropriate number of times (cycles) to obtain the desired 
amount of ampirfied sequencing ladders. The quantity of the base specifically terminated fragment desired dictates 
how many cycles are perfomned. Although an increased number of cycles results fn an increased level of amplification, 
it may also detract from the sensitivity of a subsequent detection, it is therefore generally undesirable to perfomi more 
than about 50 cycles, and ts more preferable to perform less than about 40 cycles (e.g. , about 20-30 cycles). 
[0242] Another preferred process for simultaneously amplifying and chain temriinating a nucleic acid sequence Is 
based on strand displacement amplification (SDA) (see, e.g.. Walker etal. (1994) Nucl. Actds Res. 22 :2670-77; Euro- 
pean Patent Publication Number 0 684315 entitled "Strand Displacement Amplification Using Thermophilic Enzymes"). 
In essence, this process involves the following three steps, which altogether constitute a cycle: 1) denaturing a double 
stranded (ds) DNA molecule containing the sequence to be amplified at an appropriate temperature and for an appro- 
priate period of time to obtain the two single stranded (ss) DNA molecules (the template: sense and antisense strand); 
2) contacting the template with at least one primer (P), that contains a recognition/cleavage site for a restriction endo- 
nuclease (RE) and that hybridizes to at least one ss DNA template at an appropriate temperature and for an appropriate 
period of time to obtain a primer containing ss DNA template; 3) contacting the primer containing template at an ap- 
propriate temperature and for an appropriate period of time with (i) a complete set of chain elongating nucleotides; (ri) 
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at least one chain terminating nucleotide; (iii) a first DNA polymerase, which has a relatively low affinity towards the 
chain terminating nucleotide; (Iv) a second DNA polymerase, which has a relatively high affinity towards the chain 
terminating nucleotide; and (v) an RE that nicks the primer recognitionydeavage site. 

[0243] Steps 1)-3) can be sequentially pefformed for an appropriate number of times (cycles) to obtain the desired 
amount of amplified sequencing ladders. As with the PGR based process, the quantity of the base specifically termi- 
nated fragment desired dictates how many cycles are performed. Preferably, less than 50 cycles, more preferably less 
than about 40 cycles and most preferably about 20 to 30 cycles are performed. 

[0244] Preferably about 0.5 to about 3 units of polymerase is used in the combined amplification and chain termination 
reaction, Most preferably about 1 to 2 units is used. Particularly preferred polymerases for use in conjunction with PGR 
or other thermal amplification process are themnostable polymerases, such asTaq DNA polymerase (Boehringer Man- 
nheim), AmpliTaq FS DNA polymerase (Perkin-Elmer), Deep Vent (exo-), Vent, Vent (exo-) and Deep Vent DNA 
polymerases (New England Biolabs), Thermo Sequenase (Amersham) or exo[-) Pseudococcus furiosus (Pfu) DNA 
polymerase (Stratagene, Heidelberg, Germany). AmpliTaq, Ultman, 9 degree Nm, Tth, Hot Tub, and Pyrococcus furi- 
osus. In addition, preferably the polymerase does not have 5'-3' exonuclease activity, 

[0245] In addition to polymerases, which have a relativefy high and a relatively low affinity to the chain terminating 
nucleotide, a third polymerase, which has proofreading capacity (e.g., Pyrococcus woesei (Pwo)) DNA polymerase 
may also be added to the amplification mixture to enhance the fidelity of amplification. 

[0246] Yet another method for generating base specifically terminated fragments involves contacting an appropriate 
amount of the target nucleic acid with a specific endonuclease or exonuclease. Preferably, the original 5' and/or 3' end 
of the nucleic acid is tagged to faciMtate the ordering of fragments. Tagging of the 3' end is particularly preferred when 
in vitro nucleic acid transcripts are being analyzed, so that the influence of 3' heterogeneity, premature termination and 
nonspecific elongation can be minimized. 5' and 3' tags can be natural (e.g. , a 3' poly A tail or 5' or 3" heterogeneity) 
orartifictai. PreferredS' and/or3' tags are selected from among the molecules described for mass-modrfication above. 
[0247] The methods provided herein are further illustrated by the following examples, which should not be construed 
as limiting In any way. 

EXAMPLE 1 

MALDI-TOF desorption of oligonucleotides directly on solid supports 

[0248] 1 g CPG (Controlled Pore Glass) was functionalized with 3-(triethoxysiiyl)-epoxypropan to form OH-groups 
on the polymer surface. Astandard olfgonucleotide synthesis with 1 3 mg of the OH-CPG on a DNA synthesizer (Miliigen, 
Modef 7500) employing p-cyanoethyl-phosphoamidites (Koster et al. (1 994) Nucleic Acids Res. 12 :4539) and TAG N- 
protecting groups (Koster etaL (1 981) Tetrahedron 37:362) was performed to synthesize a S'-Tg-SOmer oligonucleotide 
sequence in which 50 nucleotides are complementary to a ""hypothetical" SOmer sequence. T5 serves as a spacer 
Deprotection with saturated ammonra in methanol at room temperature for 2 hours furnished according to the deter- 
mination of the DMT group CPG which contained about 1 0 umot 66mer/g CPG. This 55mer served as a template for 
hybridizations with a 26'mer (with 5"-DMT group) and a 40-mer (without DMT group). The reaction volume Is 100 \i\ 
and contains about 1 nmol CPG bound 55mer as template, an equimolar amount of oligonucleotide in solution {26-mer 
or 40-mer) In 20 mM Tris-HCI, pH 7.5, 10 mM MgClg and 25 mM NaCI. The mixture was heated for 10 mln at 65'C 
and cooled to 37*C during 30" (annealing). The oligonucleotide whfch has not been hybridized to the polymer-bound 
template were removed by cer>trifugation and three subsequent washing/centrifugation steps with 1 00 ui each of ice- 
cold 50 mM ammoniumcitrate. The beads were air-dried and mixed with matrix solution (3-hydroxypicolinic acid/1 OmM 
ammonium citrate in acetonitrlle/water, 1:1), and analyzed by MALDI-TOF mass spectrometry. The results are pre- 
sented in Figures 10 and 11. 

EXAMPLE 2 

Eiectrospray (ES) desorption and differentiation of an 18-mer and 19-mer 

[0249] DNA fragments at a concentration of 50 pmole/ul In 2-propanol/10mM ammontumcarbonate (1/9, v/v) were 
analyzed simultaneously by an eiectrospray mass spectrometer. 

[0250] The successful desorption and differentiation of an 18-mer and 19-mer by eiectrospray mass spectrometry 
is shown in FIGURE 12. 


29 


EP1 164 203 A2 


EXAMPLE 3 

Detection of The Cystic Fibrosis Mutation AF508, by single step dideoxy extension and analysis by MALDI- 
TOF mass spectrometry (Competitive Oligonucleotide Simple Base Extension = COSBE) 

[0251 ] The principte of the COSBE method is shown in F]GU RE 1 3, N being the normal and M the mutation detection 
primer, respectively. 

IMATERIALS AND METHODS 

[0252] PCR AmpUfication and Strand tmmobiUzation. Amplification was carried out with exon 10 specific primers 
using standard PCR condftions (30 cycles: 1'@95*C, 1'@55*C, 2'@72**C); the reverse primer was 5' labelled with 
biotin and column purifted (Olfgopurification Cartridge, Cruachem). After ampirfication the amplified products were 
purified by column separation (Qiagen Quickspin) and immobifized on streptavidin coated magnetic beads (Dynabeads, 
Dynal, Norway) according to their standard protocol; DNA was denatured using 0.1 M NaOH and washed with 0.1M 
NaOH, 1xB+W buffer and TE buffer to remove the non-bictinylated sense strand. 

[0253] COSBE Conditions. The beads containing ligated antisense strand were resuspended in 18^1 of Reaction 
mix 1 (2 ill 10X Taq buffer, 1 ^iL (1 unit) Taq Polymerase, 2 nL of 2 mM dGTP, and 13 HgO) and incubated at SO*C 
for 5' before the addition of Reaction mix 2 (100 ng each of COSBE primers). The temperature was reduced to 60'C 
and the mixtures incubaledfor a 5' annealing/extension period; the beads were then washed in 25mM triethyiammonium 
acetate (TEAA) followed by 50mM ammorium citrate. 

[0254] Primer Sequences. All primers were synthesized on a Perseptive Biosystems Expedite 8900 DNA Synthesizer 
usingconventional phosphoramidite chemistry (Sinha et aL (1 984) Nucleic Acids Res. 12 :4539). COSBE primers (each 
containing an intentional mismatch one base before the S'-temiinus) were those used in a previous ARMS study (Ferrie 
et aL (1992) Am J Hum Genet 51 :251-262> with the exception that two bases were removed from the 5'-end of the 
normal: 

ExIO PCR (Forward): 6'-BIO-GCAAGT GAATCCTGA GCGTG-3' (SEQ ID Mo. 1) 

ExIO PCR (Reverse): 5'-GTG TGA AGG GTT CAT ATG C-3' (SEQ ID No. 2) 

COSBE AF50B-N 5'-ATC TAT ATT CAT CAT AGG AAA CAC CAC A-3' (28-mer) (SEQ ID No. 3) 

COSBE AFSOa-N 5'-GTA TCT ATA TTC ATC ATA GGA AAC ACC ATT-3' (30-mer) (SEQ ID No. 4) 

[0255] Mass Spectrometry, After washing, beads were resuspended In 1 |dL 1 8 Mohm/cm HgO. 300 nL each of matrix 
(Wu et al. (1993) Rapid Commun. Mass Spectrom. 7:142-146) solution (0.7 M S-hydroxypicolinic acid, 0.7 M dibasic 
ammonium citrate in 1:1 HgOiCHaCN) and resuspended beads (Tang et al (1995) Rapid Commun Mass Spectrom B : 
727-730) were mixed on a sample target and allowed to air dry. Up to 20 samples were spotted on a probe target disk 
for introduction into the source region of an unmodffied Thermo Bioanalysis (fomnerfy Finnlgan) Visions 2000 MALDI- 
TOP operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical 
average molecular weigJits (Mr(calc)) were calculated from atomic compositions. Vendor provided software was used 
to determine peak certroids using external calibration; 1 .08 Da has been subtracted from these to correct for the charge 
carrying proton mass to yield the text M^(exp) values. 

[0256] Scheme. Upon annealing to the bound template, the N and M primers (8508.6 and 9148.0 Da, respectively) 
are presented with dGTP; only primers with proper Watson-Crick base paring at the variable (V) position are extended 
by the polymerase. Thus if V pairs with the S'-temninal base of N, N is extended to a 8837.9 Da product (N + 1 ). Likewise, 
If V is properly matched to the M tenninus, M is extended to a 9477.3 Da M + product. 

Results 

[0257] Figures 14 - 18 show the representative mass spectra of COSBE reaction products. Better results were ob- 
tained when amplified products were purified before the biotinylated anti-sense strand was bound. 

EXAMPLE 4 

Differentiation of IHuman Apollpoprotein E Isoforms by Mass Spectrometry 

[02581 Apo lipoprotein E (Apo E), a protein component of lipoproteins, plays an essential role in lipid metabolism. For 
example, It is involved with cholesterol transport, metabolism of lipoprotein particles, immunoregulation and activation 
of a number of lipolytic enzymes. 
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[0259] There are three common isoforms of human Apo E (coded by E2, E3 and E4 alleles). The most common js 
the E3 allele. The E2 allele has been shown to decrease the cholesterol level in plasma and therefore may have a 
protective effect against the development of atherosclerosis. The DNA encoding a portion of the E2 allele is set forth 
in SEQ ID No. 130. Finally, the E4 isoform has been correlated with increased levels of cholesterol, confemng predis- 
positionto atherosclerosis. Therefore, the identity of the apo E allele of a particularindividual is an important determinant 
of risk for the development of cardiovascular disease. 

[0260] As shown in Figure 1 9, a sample of DNA encoding apolipoprotein E can be obtained from a subject, amplified 
(e^, via PGR); and the amplified product can be digested using an appropriate enzyme (e.g., Cfot). The restriction 
digest obtained can then be analyzed by a variety of means. As shown in Figure 20, the three isotypes of apolipoprotein 
E (E2, E3 and E4 have different nucleic acid sequences and therefore also have dfstinguishabie molecular weight 
values. 

[0261] As shown in Figure 21 A-C, different Apolipoprotein E genotypes exhibit different restriction patterns in a 3.5% 
MetPhor Agarose Gel or 12% polyacrylamlde gel. As shown in Figures 22 and 23, the various apolipoprotein E geno- 
types can also be accurately and rapidly detemnined by mass spectrometty. 

EXAMPLE 5 

Detection of hepatitis B virus in serum samples. 
MATERIALS AND METHODS 

Sample preparation 

[0262] Phenol/cholofomn extraction of viral DNA and the final ethanol precipitation was done according to standard 
protocols. 

First PGR 

[0263] Each reaction was performed with 5|xl of the DNA preparation from serum. 15 pmol of each primer and 2 units 
Taq DNA polymerase (Perkin Elmer, Weiterstadt, Germany) were used. The final concentration of each dNTP was 
200ixMIV1, the final volume of the reaction was 50 1^1. lOx PGR buffer (Perkin Elmer, Weiterstadt, Germany) contained 
100 mlVI Tris-HCI, pH 8.3, 500 mM KCI, 15 mM MgClg, 0.01% gelatine (w/v). Primer sequences: 


Primer SEQUENCE S€Q ID No. 

1 S'-GCTTTGGGGCATGGACATTGACCCGTATAA 3' 5 

2 5'-CTGACTACTAATTCCCTGGATGCTGGGTCT-3' 6 

Nested PGR: 

[0264] Each reaction was performed either with 1 ^.1 of the first reaction or with a 1 :1 0 dilution of the first PGR as 
template, respectively 1 00 pmol of each primer, 2.5 u fYu(exO') DNA polymerase (Stratagene, Heidelberg, Germany), 
a final concentration of 200 of each dNTPs andS^il 1 0x fYu buffer (200 mM Tris-HCI, pH 8.75, 100 mM KCI, 100 
mlVI (NH4)2S04, 1% Triton X-100, 1 mg/ml BSA, (Stratagene, Heidelberg, Gennany) were used in a final volume 50 
ixl. The reactions were performed in a themnocycler (OmniGene, MWG-Blotech, Ebensberg, Germany) using the fol- 
lowing pnDgram: 92'C for 1 minute, 60*C for 1 minute and 72*'C for 1 minute with 20 cycles. Sequence of oligodeox- 
ynucleotldes (purchased HPLC-purified from MWG-Biolech, Ebersberg, Germany): 

H8V13: 5'-TTGCCTGAGTQCAGTATQQT-3' (SEQ ID NO. 7) 

HBVlSbio: Biotin-5'-AGCTCTATATCGGGAAGCCT-3' (SEQ ID NO. 8) 
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Purification of ampHfied products: 

[0265] For the recording of each spectrum, one PGR, 50 (performed as described above) was used. Purification 
was done according to tine foiiowing procedure: Uitraflltration was done using Ultrafree-MC fiitration units (Miliipore, 
Eschbom, Gemnany) according to the protocoi of tine provider with centrifugation at 8000 rpm for 20 minutes. 25^1 
{10|ig/|il) streptavidin Dynabeads (Dynaf, Hamburg, Germany) were prepared according to the instnjclions of the man- 
ufacturer and resuspended in 25p.i of B/W buffer (10 mM Tris-HCi, pH 7.5, 1mM EDTA, 2 M NaCI). This suspension 
was added to the PGR samples stiii in the filtration unit and the mixture was incubated with gentle shaking for 15 
minutes at ambient temperature. The suspension was transferred in a 1 .5 mi Eppendorf tube and the supernatant was 
removed with the aid of a Magnetic Particle Collector, MPC, (Dynal, Hamburg, Germany). The beads were washed 
twice with 50 |aI of 0.7 M ammonium citrate solution, pH 8.0 (the supernatant was removed each time using the MPC). 
Cleavage from the beads can be accomplished by using formamide at 90'C. The supernatant was dried in a speedvac 
for about an hour and resuspended in 4 jil of ultrapure water (MilliQ UP plus Miliipore, Eschborn, Germany). This 
preparation was used for MALDI-TOF MS analysis. 

MALDi-TOFMS: 

[0266] Half a microirter of the sample was pipetted onto the sample holder, then immediately mixed with 0.5 ^li matrix 
solution (0.7 M3-hydroxypicolinic acid 50% acetonitriJe, 70 mM ammonium citrate). This mixture was dried at ambient 
temperature and introduced into the mass spectrometer. All spectra were taken in positive ion mode using a Finnigan 
MAT Vision 2000 (Finnigan MAT, Bremen, Germany), equipped with a refiectron (5 keV ion source, 20 keV postaccel- 
eration) and a 337 nm nitrogen laser. Cailbration was done with a mixture of a 40-mer and a 100-mer, Each sample 
was measured with different laser energies. In the negative samples, the amplified product was detected neither with 
less nor with higher laser energies. In the positive samples the amplified product was detected at different places of 
the sample spot and also with varying laser energies. 

RESULTS 

[0267] A nested PGR system was used for the detection of HBV DNA in blood samples employing oligonucleotides 
complementary to the c region of the HBV genome (primer 1 : beginning at map positron 1 763, primer 2 beginning at 
map position 2032 of the complementary strand) encoding the HBV core antigen (HBVcAg). DNA was isolated from 
patients serum according to standard protocols. A first PGR was performed with the DNA from these preparations 
using a first set of primers. If HBV DNA was present in the sample a DNA fragment of 269 bp was generated. 
[0268] fn the second reaction, primers which were complementary to a region within the PGR fragment generated 
In the first PGR were used, if HBV related amplified products were present fn the first PGR a DMA fragment of 67 bp 
was generated (see Fig. 25A) in this nested PGR. The usage of a nested PGR system for detection provides a high 
sensitivity and also serves as a specificity control for the external PGR (Rolfs et af. (1992) PGR: Clinfcal Diagnostics 
and Research, Springer, Heidelberg). A further advantage is that the amount of fragments generated in the second 
PGR is high enough to ensure an unproblematic detection although purification losses can not be avoided. 
[0269] The samples were purified using ultrafiltration to restreptavidin Dynabeads. This puriffcation was done be- 
cause the shorter primer fragments were immobilized in higher yield on the beads due to stearic reasons. The immo- 
bilization was done directly on the uitraflltration membrane to avoid substance losses due to unspecific absorption on 
the membrane. Following immobilization, the beads were washed with ammonium citrate to perfomn cation exchange 
(Pieles et al (1993) Nucl. Acids Res. 21:3191-3196). The immobilized DNA was cleaved from the beads using 25% 
ammonia which allows cleavage of DNA from the beads in a very short time, but does not result in an introduction of 
sodium or other cations. 

[0270] The nested PCRs and the MALDt TOF analysis were performed without knowing the results of serological 
anafysis. Due to the unknown virus titer, each sample of the first PGR was used undiluted as template and in a 1 :10 
dilution, respectively. 

10271] Sampfe 1 was collected from a patient with chronic active HBV infection who was positive in Hbs- and Hbe- 
antigen tests but negative In a dot blot analysis. Sample 2 was a serum sample from a patient with an active HBV 
infection and a massive viremia who was HBV positive in a dot blot analysis. Sample 3 was a denatured serum sample 
therefore no serological analysis could be performed by an increased level of transaminases indicating liver disease 
was detected. In autoradiograph analysis (Figure 24), the first PGR of this sample was negative. Nevertheless, there 
was some evidence of HBV infection. This sample Is of interest for MALDI-TOF analysis, because it demonstrates that 
even low-level amounts of amplified products can be detected after the purification procedure. Sample 4 was from a 
patient who was cured of HBV infection. Samples 5 and 6 were collected from patients with a chronic active HBV 
infection. 
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[0272] Figure 24 shows the results of a PAGE analysis of the nested PGR reaction. A amplified product is clearly 
revealed in sannplea 1 , 2, 3, 5 and 6. In sannple 4 no ampiifted product was generated, it is indeed HBV negative, 
according to the serological analysis. Negative and positive controls are indicated by + and respectively. Amplification 
artifacts are visible In lanes 2, 5, 6 and + If non-diluted template was used. These artifacts were not generated if the 
template was used in a 1:10 dilution. In sample 3, ampliffed product was merely detectable if the template was not 
diluted. The results of PAGE analysis are in agreement with the data obtained by serological analysis except for sample 
3 as discussed above. 

[0273] Figure 26A shows a mass spectrum of a nested amplified product from sample number 1 generated and 
purified as described above. The signal at 20754 Da represents the single stranded amplified product (calculated: 
20735 Da, as the average mass of both strands of the amplified product cleaved from the beads). The mass difference 
of calculated and obtained mass is 1 9 Da (0.09%), As shown in Fig. 25A, sample number 1 generated a high amount 
of amplified product, resulting in an unambiguous detection. 

[0274] Fig. 25B shows a spectrum obtained from sample number 3, As depicted in Rg. 24, the amount of amplified 
product generated in this section is signlficantty lower than that from sample number 1 . Nevertheless, the ampliffed 
product is clearly revealed with a mass of 20751 Da (calculated 20735). The mass difference Is 16 Da (0.08%). The 
spectrum depicted m Fig. 25G was obtained from sample number 4 which is HBV negative (as is also shown in Fig 
24). As expected no signals corresponding to the amplified product could be detected. Alf samples shown in Fig. 25 
were analyzed with MALDI-TOF MS, whereby amplified product was detected in all HBV positive samples, but not In 
the H8V negative samples. These results were reproduced in several independent experiments. 

EXAMPLE 6 

Analysis of Ligase Chain Reaction Products Via MALDI-TOF Mass Spectrometry 
MATERIALS AND METHODS 

Ofigodeoxynudeotides 

[0275] Except the biotinylated one and all other oligonucleotides were synthesized in a 0.2 ^mol scale on a MiiliGen 
7500 DNA Synthesizer (Millipore, Bedford, MA. USA) using the p-cyanoethyiphosphoamidite method (Sinha, N.D, et 
aL (1984) Nucleic Acids Res. 12:4539-4577). The oligodeoxy nucleotides were RP-HPLC-pu rifled and deprotected 
according to standard protocols. The biotinylated oligodeoxynucleotide was purchased ( HP LC-pu rifled) from Biometra, 
Gottmgen, Germany). Sequences and calculated masses of the oligonucleotides used: 


Oligodeoxy^ SEQUENCE 5EQ ID 

nucleotide IMo, 

A 5 -p-TTGTGCCACGCGGTTGGGAATGTA (7521 Da) 9 

B 5'-p-AGCAACGACTQTTTGCCCGCCAGTTG (7948 Dal 10 

C 5'-biO-TACATTCCCAACCGCGTGGCACAAC (7960 Da) 1 1 

D 5'-p-AACTGGCGGGCAAACAGTCGTTGCT {7708 Da) 12 


5-Phosphorytation of oligonucfeotides A and D 

[0276] This was performed with polynucleotide kinase (Boehringer, Mannheim, Germany) according to published 
procedures, the S'-phosphorylated oligonucleotides were used unpurified for LCR. 

Ligase chain reaction 

[0277] The LCR was peiformed with Pfu DNA ligase and a ligase chain reaction kit (Stratagene, Heidelberg, Ger- 
many) containing two different pBluescript Kll phagemids. One carrying the wiJdtype form of the E.cofi /ac\ gene and 
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the other one a mutant of this gene with a single point mutation at bp 191 of the fac\ gene. 

[0278] The foilowing LCR conditions were used for each reaction: 100 pg tennpiate DNA (0.74 fnnol) with 500 pg 
sonified salmon spernn DNA as carrier, 25 ng (3.3 pmol) of each 5'-phospliorylated oiigonucieotide, 20 ng (2.5 pmoi) 
of each non-phosphorylated oiigonucieotide, 4 U Pfu DNA llgase in a final volume of 20 |il buffered ss 50-mer was 
used (I fmol) as tempiate, in this case oligo C was also biotinylated. All reactions were perfomied In a thermocycler 
(OmnlGene, MWG-Blotech, Ebersberg, Germany) with the following program: 4 minutes 92*C, 2 nninutes SO^C and 
25 cycles of 20 seconds 92*C, 40 seconds 60'C, Except for HPLC analysis the biotinylated ligation educt C was used. 
In a control experiment the biotinylated and non-blotinyfated oligonucleotides revealed the same gel electrophoretlc 
results. The reactions were analyzed on 7.5% polyacrylamide gels. Ligation product 1 (oil go A and B) calculated mass: 
15450 Da, ligation product 2 (ollgo C and D) calculated mass: 15387 Da. 

SMART'HPLC 

[0279] Ion exchange HPLC (1 E HPLC) was performed on the SMART-system (Pharmacia, Freiburg, Gemiany) using 
a Pharmacia Mono Q, PC 1 .6/5 column. Eluents were buffer A (25 mM Trls-HCl, 1 mM EDTA and 0.3 M NaCl at pH 
B.O) and buffer B (same as A, but 1 M NaCI). Starting with 1 00% A for 5 minutes at a flow rate of SOnl/min, a gradient 
was applied from 0 to 70% B in 30 minutes, then increased to 1 00% B in 2 minutes and held at 100% B for 5 minutes. 
Two pooled LCR volumes (40^1) perfonmed with either wlldtype or mutant template were Injected. 

Sample preparation for MALDf-TOF-MS 

[0280] Preparation of Immobilized DNA: For the recording of each spectrum two LCRs (performed as described 
above) were pooled and diluted 1:1 with 2x B/W buffer (10 mM Tris-HCl, pH 7.5, 1 mM EDTA, 2 M NaCl). To the 
samples 5 jil streptavldln DynaBeads (Dynal, Hamburg, Gemiany) were added, the mixture was allowed to bind with 
gentle shaking for 15 minutes at ambient temperature. The supernatant was removed using a Magnetic Particle Col- 
lector, MPC, (Dynal, Hamburg, Gennany) and the beads were washed twice with 50 i^l of 0.7 M ammonium citrate 
solution (pH 8.0) (the supernatant was removed each time using the MPC). The beads were resuspended in 1 ^il of 
ultrapure water (MilllQ, Millipore, Bedford, Mabelow). 

[0281 ] Combination of ultrafiltration and streptavldln DynaBeads: For the recording of spectrum two LCRs (performed 
as described above) were pooled, diluted 1:1 with 2x BAN buffer and concentrated with a 5000 NMWL Ultrafree-MC 
filter unit (Milllpore, Eschborn, Germany) according to the Instructions of the manufacturer. After concentration the 
samples were washed with 300 |xl 1x B/W buffer to streptavldln DynaBeads were added. The beads were washed once 
on the LUtrafree-MC filtration unit with 300 jxl of 1x B/W buffer and processed as described above. The beads were 
resuspended in 30 to 50 ^il of 1x B/W buffer and transferred in a 1 .5 ml Eppendorf tube. The supernatant was removed 
and the beads were washed twice with 50 ^.1 of 0.7 M ammonium citrate (pH 8.0), Finally, the beads were washed once 
with 30 \l\ of acetone and resuspended in 1 pif of ultrapure water. The ligation mixture after Immobilization on the beads 
was used for MALDS-TOF-MS analysis as described below. 

MALDI-TOF-MS 

[0282] A suspension of streptavidin-coated magnetic beads with the Immobilized DNA was pipetted onto the sample 
holder, then immediately mixed with 0.5 ^l matrix solution (0.7 M 3-hydroxypicollnic acid In 50% acetonltrlie, 70 mM 
ammonium citrate). This mixture was dried at ambient temperature and Introduced into the mass spectrometer. All 
spectra were taken In positive Ion mode using a Finnigan MAT Vision 2000 (Finnlgan MAT, Bremen, Germany), 
equipped with a ref lectron (5 keV ion source, 20 keV postacceleration) and a nitrogen laser (337 nm). For the analysis 
of Pfu DNA llgase 0.5 \i\ of the solution was mixed on the sample holder with 1 jxl of matrix solution and prepared as 
described above. For the analysis of unpurlfled LCRs 1 jil of an LCR was mixed with 1 ^l matrix solution. 

RESULTS 

[0283] The E, coif tac\ gene served as a simple model system to Investigate the suitabiJlty of MALDI-TOF-MS as 
detection method for products generated in llgase chain reactions. This template system contains of an E. colt lad 
wlldtype gene in a pBIuescrlpt Kll phagemld and an E. coti/ad gene carrying a single point mutation at bp 191 (C to 
T transition; SEQ ID No. 131) In the same phagemid. Four different oligonucleotides were used, which were (Igated 
only if the E colt /ac/ wildtype gene was present (Figure 26). 

[0284] LCR conditions were optimized using Pfu DNA ligase to obtain at least 1 pmol ligation product in each positive 
reaction. The ligation reactions were analyzed by polyacrylamide gel electrophoresis (PAGE) and HPLC on the SMART 
system (Figures 27, 2B and 29). Figure 27 shows a PAGE of a positive LCR with wlldtype template (lane 1 ), a negative 
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LCR with mutant template (1 and 2) and a negative control which contains enzyme, oligonucleotides and no template 
but salmon spenn DNA. The gel electrophoresis clearly shows that the ligation product (50 bp) was produced only in 
the reaction with wildtype template; whereas neither the template carrying the point mutation nor the control reaction 
with salmon sperm DNA generated amplification products, in Figure 28, HPLC was used to analyze two pooled LCRs 
with wildtype template perfonned under the same conditions. The ligation product was clearly revealed. Figure 29 
shows the results of a HPLC in which two pooled negative LCRs with mutant template were analyzed. These chroma- 
tograms confirm the data shown in Figure 27 and the results taken together clearly demonstrate, that the system 
generates ligation products in a significant amount only if the wildtype template is provided. 

[0285] Appropriate control runs were performed to detenmrne retention times of the different compounds involved in 
the L OR experiments. These include the four oligonucleotides (A, B, C, and D), a synthetic ds 50-mer (with the same 
sequence as the ligation product), the wildtype template DNA, sonicated salmon spenn DNA and the Pfu DNA ligase 
in ligation buffer. 

[0286] In orderto test which purification procedure should be used before a LCR reaction can be analyzed by MALDI- 
TOF-MS, aliquots of an unpurified LCR (Figure 30A) and aliquots of the enzyme stock solution (Figure 30B) were 
analyzed with MALDI-TOF-MS. It turned out that appropriate sample preparation is absolutely necessary since all 
signals in the unpurified LCR correspond to signals obtained in the MALDI-TOF-MS analysis of the Pfu DMA ligase. 
The calculated mass values of oligo A and the ligation product are 7521 Da and 15450 Da, respectively. The data in 
Figure 30 show that the enzyme solution leads to mass signals which do interfere with the expected signals of the 
ligation educts and products and thenefore makes an unambiguous signal assignment impossible, Furthermore, the 
spectra showed signals of the detergent Tween20 being part of the enzyme storage buffer which influences the crys- 
tallization behavior of the analyte/matrix mixture in an unfavorable way. 

[0287] In one purification fonnat streptavi din-coated magnetic beads were used. As was shown in a recent paper, 
the direct desorption of DNA immobilized by Watson-Crick base pairing to a complementary DNA fragment covalentfy 
bound to the beads is possible and the non-biotiny fated strand will be desorbed exclusively (Tang et al. (1 995) Nucleic 
Acids Res. 23:3126-3131). This approach in using immobilized ds DNA ensures that only the norvblotinylated strand 
wilt be desorbed. if non-immobilized ds DNA is analyzed both strands are desorbed (Tang et al. (1994) Rapid Comm. 
Mass Soectrom. 7 183-186) leading to broad signals depending on the mass difference of the two single strands. 
Therefore, empfoying this system for LCR only the non-ligated oligonucleotide A, with a calculated mass of 7521 Da, 
and the ligation product from oligo A and oligo B (calculated mass: 15460 Da) will be desorbed if oligo C Is blotinylated 
at the 5'-end and immobilized on steptavidin-coated beads. This results in a simple and unambiguous identification of 
the LCR educts and products. 

[0288] Figure 31 A shows a MALDl-TOF mass spectrum obtained from two pooled LCRs (performed as described 
above) purified on streptavidin DynaBeads and desorbed directly from the beads showed that the purification method 
used was efficient (compared with Figure 30). A signal which represents the uniigated oligo A and a signal which 
corresponds to the ligation product could be detected. The agreement between the calculated and the experimentally 
found mass values is remarkable and allows an unambiguous peak assignment and accurate detection of the ligation 
product, in contrast, no ligation product but only oligo A could be detected in the spectrum obtained from two pooled 
LCRs with mutated template (Figure SIB). The specificity and selectivity of the LCR conditions and the sensitivity of 
the MALDl-TOF detection is further demonstrated when performing the ligation reaction in the absence of a specific 
template. Figure 32 shows a spectrum obtained from two pooled LCRs in which only salmon spenn DNA was used as 
a negative control, only oligo A could be detected, as expected. 

[0289] While the results shown in Figure 31 A can be con^elated to lane 1 of the gel in Figure 27, the spectrum shown 
in Figure 31 B is equivalent to lane 2 in Figure 27, and finally also the spectrum in Figure 32 corresponds to fane 3 in 
Figure 27. The results are In congruence with the HPLC analysis presented in Figures 28 and 29. While gel electro- 
phoresis (Figure 27) and HPLC (Figures 28 and 29) reveal either an excess or almost equal amounts of ligation product 
over ligation educts, the analysis by MALDl-TOF mass spectrometry produces a smaller signal for the ligation product 
(Figure 31 A), 

[0290] The lower intensity of the ligation product signal could be due to different desorption/ionization efficiencies 
between 24- and a 50-mer. Since the T^, value of a duplex with 50 compared to 24 base pairs is significantly higher, 
more 24-mer could be desorbed. A reduction in signal intensity can also result from a higher degree of fragmentation 
In case of the longer oligonucleotides. 

[0291] Regardless of the purification with streptavidin DynaBeads, Figure 32 reveals traces of Tween20 in the region 
around 2000 Da. Substances with a viscous consistence, negatively influence the process of crystallization and there- 
fore can be detrimental to mass spectrometer analysis. Tween20 and also glycerol which are part of enzyme storage 
buffers therefore should be removed entirely prior to mass spectrometer analysis. For this reason an improved purifi- 
cation procedure which Includes an additional ultrafiltration step prior to treatment with DynaBeads was investigated. 
Indeed, this sample purification resulted in a significant improvement of MALDl-TOF mass spectrometric performance. 
[0292] Figure 33 shows spectra obtained from two pooled positive (Fig. 33A) and negative (Fig. 338) LCRs, respec- 
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lively. The positive reaction was performed with a chemicaiiy synthesized, singie strand 50mer as template with a 
sequence equivafent to the ligation product of oiigo C and D. Oltgo C was 5'-biotlnylated. Therefore the template was 
not detected. As expected, only the ligation product of Oligo A and B (calculated mass 1 5450 Da) could be desorbed 
from the immobilized and ligaled oligo C and D. This newly generated DNA fragment is represented by the mass signal 
of 15448 Da in Figure 33A. Compared to Figure 32 A, this spectrum clearly shows that this method of sample preparation 
produces signals with improved resolution and intensity, 

EXAMPLE? 

Mutation detection by solfd phase oligo base extension of a primer and analysis by MALDi-TOF mass 
spectrometry (Primer Oligo Base Extension = Probe) 

Summary 

[0293] The soJid-phase origo base extension method detects point mutations and small deletions as well as small 
insertions in amplified DNA. The method is based on the extension of a detection primer that anneals adjacent to a 
variable nucleotide position on an affinity-captured amplified template, using a DNA polymerase, a mixture of three 
dNTPs, and the missing one dideoxy nucfeotide. The resulting products are evaluated and resolved by MALDI-TOF 
mass spectrometry without further labeling procedures. The aim of the following experiment was to detennine mutant 
and wildtype alleles in a fast and reliable manner. 

Description of the experiment 

[0294] The method used a single detection primer foUowed by a oligonucleotide extension step to give products 
differing in length by some bases specific for mutant or wildtype aJleies which can be easily resolved by MALDI-TOF 
mass spectrometry. The method is described by using as example the exon 10 of the CFTR-gene. Exon 10 of this 
gene bears the most common mutation in many ethnic groups (AF508) that leads in the homozygous state to the clinical 
phenotype of cystic fibrosis. 

MATERIALS AND METHODS 

Genomic DNA 

[0295] Genomic DNA were obtained from healthy individuals, individuals homozygous or heterozygous for the AF50a 
mutation, and one individual heterozygous for the 1506S mutation. The wfldtype and mutant alleles were confirmed 
by standard Sanger sequencing. 

PCH amplification of exon 10 of the CFTR gene 

[0296] The primers for PGR amplificatfon were CFEx10-F {5-GCAAGTGAATCCTGAGCGTG-3' (SEQ ID No. 13) 
located in intron 9 and biotinylated) and CFExlO-R (5'-GTGTGAAGGGCGTG-3' SEQ ID No. 14) located in intron 10). 
Primers were used in a concentration of 8 pmol. Taq-polymerase including 1 0x buffer were purchased from Boehringer- 
Mannheim anddTNPs were obtained from Pharmacia. The total reaction volume was 50^1. Cycling conditions for PGR 
were initially 5 min. at 95»C. followed by 1 min. at 94*C, 45 sec at 53^*0, and 30 sec at 72^0 for 40 cycles with a final 
extension time of 5 min at 72'*C- 

Purification of the amplified products 

[0297] Amplification products were purified by using Qiagen's PGR purification kit (No. 28106) according to manu- 
facturer's instructions. The elution of the purified products from the column was done in 60 ^\ TE-buffer (1 OmM Tris, 1 
mM EDTA, pH 7,5). 

Affinity-capture and denaturation of the double stranded DNA 

[0298] 1 0 i^L aliquots of the purified amplified product were transferred to one well of a streptavidin-coated microtiter 
plate (No. 1645684 Boehringer-Mannheim or No, 95029262 Labsystems). Subsequently, 10 ^il Incubation buffer (80 
mM sodium phosphate, 400 mM NaCI, 0,4% Tween20, pH 7,5) and 30 ^li water were added. After incubation for 1 hour 
at room temperature the welJs were washed three times with 200^1 washing buffer (40 mM Tris, 1 mM EDTA, 50 mM 
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NaCI, 0.1% Tween 20, pH B.B). To denature the double stranded DNA the wells were treated with 100 jil of a 50 mM 
NaOH solution for 3 min and the welts washed three times with 200 \i\ washing buffer 

Ofigo basB extension reaction 

[0299] The annea^ng of 25 pmol detection primer (CF508: 5'-CTATATTCATCATAGGAAACACCA-3' (SEQ ID No. 
1 5) was perfornned in 50 jii annealing buffer (20 mM Tris, 10 mM KCI, 1 0 nnM (NH4)2S04, 2 mM MgS02, 1 % Triton X- 
1 00, pH 8) at 50**C for 1 0 min. The wells were washed three times with 200 |i.l washing buffer and once in 200 \i\ TE 
buffer. The extension reaction was performed by using some components of the DNA sequencing kit from USB (No. 
70770) and dNTPs or ddNTPs from Phamnacia. The total reaction volume was 45 jxl, containing of 21 ^1 water, 6 \l\ 
Sequenase-buffer, 3 |xl 10 mM DTT solution, 4,5 jxl, 0,5 mM of three dNTPs, 4,5 ^il, 2 mM the missing oneddNTP, 5,5 
111 glycerol enzyme dilution buffer, 0,25 |^l Sequenase 2.0, and 0,25 pyrophosphatase. The reaction was pipetted on 
ice and then incubated for 15 min at room temperature and for 5 min at 37**C. Hence, the wells were washed three 
times with 200 \l\ washing buffer and once with 60 \l\ of a 70 mM NH4-Citrate solution. 

Denaturation and precipitation of the extended primer 

[0300] The extended primer was denatured in 50 ^1 1 0%-DMSO (dimethylsufoxide) in water at 80'C for 1 0 min. For 
precipitation, 10 nl NH^-Acetale (pH 6.5), 0,5 m,I glycogen (10 mg/ml water, Sigma No, G1766), and 100 [i\ absolute 
ethanof were added to the supernatant and incubated for 1 hour at room temperature. After centrifugation at 13.000 g 
for 10 min the peJIet was washed in 70% ethanol and resuspended in 1 |il 18 Mohm/cm HgO water. 

Sampie preparation and analysis on MALDI-TOF mass spectrometry 

[0301] Sample preparation was performed by mixing 0,3 |il of each of matrix solution (0.7 M 3-hydroxypicollnic acid, 
0.07 M dibasic ammonium citrate in 1:1 HgOiCHaCN) and of resuspended DNA/glycogen pellet on a sample target 
and allowed to air dry. Up to 20 samples were spotted on a probe target disk for introduction into the source region of 
an unmodified Thermo Bioanalysis (fonmerly Finnigan) Visions 2000 MALDI-TOF operated in refiectron mode with 5 
and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular mass (M^Ccalc)) were 
calculated from atomic compositions; reported experimental Mr (Mr(exp)) vaiues are those of the singly-protonated 
form, determined using external calibration. 

RESULTS 

[0302] The aim of the experiment was to develop a fast and reliable method independent of exact stringencies for 
mutation detection that loads to high quality and high throughput in the diagnosis of genetic diseases. Therefore a 
special kind of ONA sequencing (oligo base extension of one mutation detection primer) was combined with the eval- 
uation of the resulting mini-sequencing products by matrix-assisted laser desorption ionization (MALDI) mass spec- 
trometry (MS). The time-of-flight (TOP) refiectron arrangement was chosen as a possible mass measurement system. 
To prove this hypothesis, the examination was perfomied with exon 10 of the CFTR-gene, in which some mutations 
could lead to the clinical phenotype of cystic fibrosis, the most common monogenetic disease in the Caucasian pop- 
ulation. 

[0303] The schematic presentation as given in Figure 34 shows the expected short sequencing products with the 
theoretically calculated molecular mass of the wildtype and various mutations of exon 10 of the CFTR-gene (SEQ ID 
No. 132). The short sequencing products were produced using either ddTTP (Figure 34A; SEQ ID Nos. 133-135) or 
ddCTP (Figure 34B; SEQ ID Nos. 1 36-1 39) to introduce a definitive sequence refated stop in the nascent DNA strand. 
The MALDI-TOF-MS spectra of healthy, mutation heterozygous, and mutation homozygous individuals are presented 
in Figure 35. All samples were confirmed by standard Sanger sequencing which showed no discrepancy in comparison 
to the mass spec analysis. The accuracy of the experimental measurements of the various molecutar masses was 
within a range of minus 21 .8 and plus 87.1 dalton (Da) to the range expected. This allows a definitive interpretation of 
the results in each case, A further advantage of this procedure is the unambiguous detection of the AI507 mutation. 
In the ddTTP reaction , the wildtype allele would be detected, whereas in the ddCTP reaction the three base pair deletion 
would be disclosed. 

[0304] The method described is highly suitable for the detection of single point mutations or microlesions of DNA. 
Careful choice of the mutation detection primers will open the window of multiplexing and lead to a high throughput 
including high quality in genetic diagnosis without any need for exact stringencies necessary in comparable allele- 
specific procedures. Because of the uniqueness of the genetic information, the oligo base extension of mutation de- 
tection primer is applicable in each disease gene or pofymorphic region in the genome like variable number of tandem 
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repeats (VNTR) or other single nucleotide polymorphisms (e.g. , apollpoprotein E gene), as also described here. 
EXAMPLE 8 

Detection of Polymerase Chain Reaction Pioducts Containing 7-Deazapurine Moieties with Matrix-Assisted 
l^ser Desorption/ionlzation Time-of-Fllght (MALDI-TOF) Mass Spectrometry 

MATERIALS AND METHODS 

Nucleic add amptifications 

[0305] The following ollgodeoxynucleotid© primers were either synthesized according to standard phosphoamidite 
chemistry (Sinha, N.D,. etaK. (1983) Tefraftec/ron Z.ef. Vol, 24, Pp. 5843-5846; SInha, N.D., etaL, (1984) Nucieic Acids 
Hes., Vol. 12, Pp. 4539-4557) on a MilliGen 7500 DNA synthesizer (Milllpore, Bedford, MA. USA) in 200 nmol scales 
or purchased from MWG-Blotech (Ebersberg, Gemriany, primer 3) and Biometra (Goettingen, Germany, primers 6-7). 


primer 1 
primer 2 
primer 3 
primer 4 
primer 5 
primer 6 


5'-GTCACCCTCGACCTGCAG (SEQ ID NO, 16); 
5'-TTQTAAAACGACGGCCAGT (SEQ ID NO. 17); 
5'-CTTCCACCGCGATGTTGA (SEQ ID NO. 18); 
5'-CAGGAAACAGCTATGAC (SEQ ID NO, 19); 
5'-QTAAAACGACGGCCAGT (SEQ ID NO. 20); 
5''GTCACCCTCGACCTGCAgC (g: RiboG) {SEQ ID NO. 21 >; 


primer 7; 5'-GTTGTAAAACGAGGGCCAgT (g: RiboG) (SEQ ID NO. 
22); 

[0306] The 99-mer (SEQ ID No. 141) and 200-mer DNA strands (SEQ iD No. 140; modified and unmodified) as well 
as the ribo- and 7-deaza-modified 1 00-mer were amplified from pRFcl DNA (10 ng, generously supplied by S. Feyer- 
abend, University of Hamburg) In 100 \lL reaction volume containing 10 mmol/L KCI, 10 mmol/L (NH^jaSO^, 20 mmoU 
LTris HCI (pH 8.8), 2 mmol/L MgS04, (exo(0 Pseudococcus furiosus (Pfu) -Buffer, Pharmacia, Freiburg, Germany), 
0.2 mmol/L each dNTP (Pharmacia, Freiburg, Germany), 1 ^JLmol/L of each primer and 1 unit of exo(-) Pfu DNA polymer- 
ase (Stratagene, Heidelberg, Germany). For the 99'mer primers 1 and 2, for the 200-mer primers 1 and 3 and for the 
1 00-mer primers 6 and 7 were used. To obtain 7-deazapurfne modified nucteic acids, during PCR-amplification dATP 
and dGTP were replaced with 7-deaza-dATP and 7-deaza- dGTP. The reaction was performed in a thermal cycler 
(OmniGene, MWG-Blotech, Ebersberg, Gennany) usirrg the cycle: denaturation at 95*0 for 1 min., annealing at 51 *C 
for 1 min. and extension at 72'C for 1 min. For all PCRs the number of reaction cycles was 30. The reactton was 
allowed to extend for additional 1 0 min. at 72*0 after the last cycle. 

[0307] The 103-mer DNA strands (modified and unmodified; SEQ ID No. 245) were amplified from M13mp18 RFI 
DNA{100ng, Pharmacia, Freiburg, Germany) in 100 jiL reaction volume, using primers 4 and 5 all other concentrations 
were unchanged. The reaction was perfomned using the cycle: denaturation at 95*0 for 1 min., annealing at 40*0 for 
1 min. and extension at 72**C for 1 min. After 30 cycles for the unmodified and 40 cycJes for the modiffed lOS-mer 
respectively, the samples were Incubated for additional 1 0 min. at 72*0. 

Syntfiesis of S'-p^-PJ-lBbefed PCR-primers 

[0308] Primers 1 and 4 were 5'-p2.p].|abeieci employing T4-polynucleotidkinase (Epicentre Technologies) and 
(^32pj.ATP. (BLU/NGG/602A, Dupont, Germany) according to the protocols of the manufacturer The reactions were 
perfomned substituting 10% of primer 1 and 4 in PGR with the labeled primers under otherwise unchanged reaction- 
conditions. The amplified DNAs were separated by gel electrophoresis on a 1 0% poiyacryfamide gel. The appropriate 
bands were excised and counted on a Packard TRf-CARB 460C liquid scintNlation system (Packard, CT, USA). 
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Primer-cleavage from ribo-modified PCR-product 

[0309] The amplified DNA was purified using Uftrafree-MC fiiter units (30,000 NMWL), it was then redissolved in 100 
jil of 0.2 mol/L NaOH and heated at 95**C for 25 minutes. The soiution was then acidified with HC1 (1 mol/L) and further 
purified for MALDi-TOF analysis employing Uttrafree-MC filter units (10,000 NMWL) as described beiow. 

Purification of ampHfied products 

[031 0] All samples were purified and concentrated using Ultrafree-IVIC units 30000 NMWL (Millipore, Eschborn, Ger- 
many) according to the manufacturer's description. After lyophilization, amplified products were redissofved in 5 |iL (3 
laL for the 200-mer) of ultrapu re water This analyte solution was directly used for MALDI-TOF measurements. 

MALDi-TOF MS 

[031 1 ] Ailquots of 0.5 \iL of anaiyte solution and 0.5 jxL of matrix solution (0.7 mol/L 3-HPA and 0.07 mol/L ammonium 
citrate in acetonitrile/water (1:1, vA^)) were mixed on a flat metaiiic sample support, Afterdrying at ambient temperature 
the sample was introduced into the mass spectrometer for analysis. The MALDI-TOF mass spectrometer used was a 
Ffnnigan MAT Vision 2000 (Finnigan MAT, Bremen, Germany). Spectra were recorded in the positive ion reflector mode 
with a 5 keV ion source and 20 keV postacceleratfon. The instrument was equipped with a nitrogen laser (337 nm 
wavelength). The vacuum of the system was 3-4•^ O'® hPa in the analyzer region and 1 -4*1 0""^hPa in the source region. 
Spectra of modified and unmodffied DNA samples were obtained with the same relative laser power; external calib ratio 
was performed with a mixture of synthetic oligodeoxynucleotldes (7-to 50-mer). 

RESULTS AND DISCUSSION 

Enzymatic synthosis of J-deazapurme nucieotide contatr)ing nucieic acids by PGR 

[0312] In order to demonstrate the feasibility of MALDI-TOF MS for the rapid, gel-free analysis of short amplified 
products and to investigate the effect of 7"dea2apurine modification of nucleic acids under MALDI-TOF conditions, two 
different primer-template systems were used to synthesize DNA fragments. Sequences are displayed in Figures 36 
and 37. While the two single strands of the 103-mer amplified product had nearly equal masses (Am = 8 u), the two 
single strands of the 99-mer differed by 526 u. Considering that 7-deaza purine nucleotide building blocks for chemical 
DNA synthesis are approximately 160 times more expensive than regular ones (Product Information, Glen Research 
Corporation, Sterling, VA) and their application in standard p-cyano-phosphoamidite chemistry is not trivial (Product 
Information, Glen Research Corporation, Sterling, VA; Schneider etal, (1995) Nucl. Acids Res. 23:1570) the cost of 
7-deaza purine modified primers would be very high. Therefore, to increase the appiicablllty and scope of the method, 
all PCRs were performed using unmodified oligonucleotide primers which are routinely available. Substituting dATP 
and dGTP by c^-dATP and c^-dGTP in polymerase chain reaction led to products containing approximately 80% 7-dea- 
za-purine modified nucleosides for the 99-mer and 103-mer; and about 90% for the 200-mer, respectively. Table II 
shows the base composition of all PGR products. 


TABLE il: 


Base composition of the 99-mer, 103-mer and 200-mer PGR amplification products (unmodified and 7-deaza 

purine modified) 

DNA-fragments"* 

C 

T 

A 

G 

c^-deaza-A 

c'^-deaza-e 

ret. mod.2 

200-mers 

54 

34 

56 

56 




modified 200-mef 
s 

54 

34 

6 

5 

50 

51 

90% 

200-mer a 

56 

56 

34 

64 




modified 200-mer 
a 

56 

56 

3 

4 

31 

50 

92% 

103-mer s 

28 

23 

24 

28 





"s" and "a" describe "sense" and "antlsanse* strands of the double-stranded amplffted product. 


Indicates reJatlve modification as percentage of 7-deaza purine modified nucleotides of totaJ amount of purine nucleotides. 
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TABLE II: (continued) 


Base composition of the 99-mer, 103-mer and 200-mer PCR amplification products (unmodified and 7-deaza 

purine modified) 

DNA-fragments'' 

C 

T 

A 

G 

c^-deaza-A 

c^-deaza-6 

rel. mod. 2 

modified 1 03-mer 
s 

26 

23 

6 

5 

18 

23 

79% 

103-iner a 

28 

24 

23 

2B 




modified 1 03-mer 
a 

28 

24 

7 

4 

16 

24 

78% 

99-mer s 

34 

21 

24 

20 




modified 99-mer s 

34 

21 

6 

5 

16 

15 

75% 

99-mer a 

20 

24 

21 

34 




modified 99-mer a 

20 

24 

3 

4 

18 

30 

87% 


"s" and "a" describe "sense" and "antlsonsa" strands of the double-stranded ampllfJed product. 


^ indicates relative modification as percentage of 7-<»eaza purine modified nucleotides of tola! amount of purine nucleotides. 

[0313] It remained to be determined whether 60-90% 7-deaza-purine modification is sufficient for accurate mass 
spectrometer detection. It was tfierefone important to determine whether all purine nucleotides could be substituted 
during the enzymatic amplification step, This was not trivial since it had been shown that c^-dATP cannot fully replace 
dATP in PCR if Tag DNA polymerase is employed (Seela, F. and A. Roeiltng (1992) Nucleic Acids Res., 20,55-61). 
Fortunately it was found that exo{-)Pfu DNA polymerase indeed could accept c'^-dATP and c^-dGTP in the absence of 
unmodified purine nucleoside triphosphates. The incorporation was Jess efficient feading to a lower yield of amplified 
product (Figure 38). 

[0314] To verify these results, the amplications with p2p].|abeJed primers were repeated. The autoradiogram (Figure 
39) clearly shows lower yields for the modified PCR-products. The bands were excised from the get and counted. For 
all amplified products the yield of the modified nucleic acids was about 50%, referring to the corresponding unmodified 
amplification product. Further experiments showed that exo(-)Deep l/enf and Vent DNA polymerase were able to in- 
corporate c^-dATP and c^-dGTP during PCR as weli. The overall performance, however, turned out to be bestforthe 
eyo{-)Pfu DNA polymerase giving least side products during amplification. Using all three polymerases, It was found 
that such PCRs employing c^-dATP and c'^-dGTP instead of their isosteres showed less side-reactions giving a cleaner 
PCR-product. Decreased occurrence of amplification side products may be explained by a reduction of primer mis- 
matches due to a ling template which is synthesized during PCR, Decreased melting point for DNA duplexes containing 
7-deaza-p urine have been described (Mizusawa, S. et aL, (I 986) Nucleic Acids Res,, 14, 1319-1324). in addition to 
the three polymerases specified above (exo(-) Deep Vent DNA polymerase, 5Vent DNA polymerase and exo(-) (Pfu) 
DNA polymerase), it is anticipated that other polymerases, such as the Large Klenow fragment of E.coN DNA polymer- 
ase, Sequenase, Taq DNA polymerase and U AmpNTaq DNA polymerase can be used. In addition, where RNA is the 
template, RNA polymerases, such as the SP6 or the T7 RNA polymerase, must be used. 

MALDf-TOF mass spectrometry of modified and unmodified amptift'ed products. 

[0315] The 99-mer, 1 03-mer and 200-mer amplified products were analyzed by MALDI-TOF MS. Based on past 
experience, it was known that the degree of depurination depends on the laserenergy used fordesorption and ionization 
of the analyte. Since the influence of 7-deazapurine modification on fragmentation due to depurination was to be in- 
vestigated, an spectra were measured at the same relative laser energy. 

[0316] Figures 40a and 40b show the mass spectra of the modified and unmodified 103-mer nucleic acids. In case 
of the modified 1 03-mer, fragmentation causes a broad (M + H)+ signal. The maximum of the peak is shifted to lower 
masses so that the assigned mass represents a mean value of (M + H)-^ signal and signals of fragmented ions, rather 
than the (M + H)+ signal itself. Although the modified 1 03-mer still contains about 20% A and G from the oligonucJeotide 
primers, it shows less fragmentation which Is featured by much more narrow and symmetric signals. Especially peak 
tailing on the lower mass side due to depurination, is substantlaNy reduced. Hence, the difference between measured 
and calculated mass is strongly reduced although It is still below the expected mass. For the unmodified sample a 
(M+H)+ signal of 31 670 was observed, which rs a 97 u or 0.3% difference to the calculated mass. While, in case of the 
modified sample this mass difference diminished to 1 0 u or 0.03% (31 71 3 u found, 31 723 u calculated)! These obser- 
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vations are verified by a significant increase in mass resoiution of the (M+H)+ signai of the two signal strands (n/Am 
67 as opposed to 1 8 for the unmodified sampie with Am = fuii width at haif maximum, fwhm). Because of the iow mass 
difference between the two singie strands (8 u) their individual signals were not resolved. 

[031 7] With the results of the 99 base pair DNA fragments the effects of increased mass resolution for 7-deazapurine 
containing DNA becomes even more evident. The two single strands in the unmodified sampie were not resolved even 
though the mass difference between the two strands of the amplified product was very high with 526 u due to unequal 
distribution of purines and pyrimidines (figure 41 a). Jn contrast to this, the modified DNA showed distinct peaks for the 
two single strands (figure 41 b) which demonstrates the superiority of this approach for the detemiination of molecular 
weights to gel electrophoretic methods even more profound. Although base line resolution was not obtained the indi- 
vidual masses were able to be assigned with an accuracy of 0.1%: Am = 27 u for the lighter (calc. mass = 30224 u) 
and Am = 14 u for the heavier strand (calc. mass = 30750 u). Again, it was found that the full width at half maximum 
was substantially decreased for the 7-deazapurine containing sample. 

[0318] In case the 99'mer and 1 03-mer, the 7-deazapurlne containing nucleic acids seem to give higher sensitivity 
despite the fact that they still contain about 20% unmodified purine nucleotides. To get comparable signal-to-noise 
ratio at similar intensities for the (M + H)+ signals, the unmodified 99-mer required 20 laser shots In contrast to 12 for 
the modified one and the 1 03-mer required 12 shots for the unmodified sample as opposed to three for the 7-dea2a- 
purine nucleoside-containing amplified product. 

[0319] Comparing the spectra of the modified and unmodified 200-mer amplicons, improved mass resolution was 
again found for the 7-deazapurine containing sample as well as increased signal intensities (Figures 42A and 42B). 
While the signai of the single strands predominates in the spectrum of the modified sample the DNA-duplex and dimers 
of the single strands gave the strongest signal for the unmodified sample. 

[0320] A complete 7-deaza purine modification of nucleic acids may be achieved either using modified pnmers in 
PGR or cleaving the unmodified primers from the partially modified amplified product. Since disadvantages are asso- 
ciated with modified primers, as described above, a lOO-mer was synthesized using primers with a ribo-modification. 
The primers were cleaved hydro iytically with NaOH according to a method developed earlier in our laboratory (Koester, 
h\.eiaL,ZPhysfoL Chem,, 359, 1570-1569). Figures 43A and 43B display the spectra of the amplified product before 
and after primer cleavage. Figure 43b shows that the hydrolysis was successful: The hydrolyzed amplified product as 
well as the two released primers could be detected together with a small signal from residual uncleaved 1 00-mer This 
procedure is especially useful for the MALDI-TOF analysis of very short PCR-products since the share of unmodified 
purines originating from the primer increases with decreasing length of the amplified sequence. 
[0321] The remarkable properties of 7-deazapurine modified nucleic acids can be explained by either more effective 
desorption and/or ionization, increased fon stability and/or a lower denatunation energy of the double stranded purine 
modified nucleic acid. The exchange of the N-7 for a methyJ group results in the loss of one acceptor for a hydrogen 
bond which influences the abifity of the nucleic acid to fonn secondary structures due to non -Watson -Crick base pairing 
(Seela, R and A. Kahne (1 987) Biochemistry, 26, 2232-223B.). In addition to this the aromatic system of 7-deazapurine 
has a lower electron density that weakens Watson -Crick base pairing resulting in a decreased melting point (Mizusawa. 
S. et aL, (1 986) Nucleic Acids Res., 14, 1 319-1324) of the double-strand. This effect may decrease the energy needed 
for denaturation of the duplex in the MALDI process. These aspects as well as the loss of a site which probably will 
carry a positive charge on the N-7 nitrogen renders the 7-deazapu ri ne modified nucleic acid less polar and may promote 
the effectiveness of desorption. 

[0322] Because of the absence of N-7 as proton acceptor and the decreased polarization of the C-N bond in 7-dea- 
zapurine nucleosides depurinatlon following the mechanisms established for hydrolysis in solution is prevented. Al- 
though a direct correlation of reactions in solution and in the gas phase is problematic, less fragmentation due to 
depurinatlon of the modified nucleic acids can be expected in the MALDI process. Depurinatlon may either be accom- 
panied by loss of charge which decreases the total yield of charged species or it may produce charged fragmentation 
products which decreases the intensity of the non fragmented molecular ion signal, 

[0323] The observation of increased sensitivity and decreased peak tailing of the (M+H)^ signals on the lower mass 
side due to decreased fragmentation of the 7-deazapurine containing samples indicate that the N-7 atom indeed is 
essential for the mechanism of depurinatlon in the MALDI-TOF process. In conclusion, 7-deazapurine containing nu- 
cleic acids show distinctly increased Ion-stability and sensitivity under MALDI-TOF conditions and therefore provide 
for higher mass accuracy and mass resolution. 
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EXAMPLE 9 

Solid Phase Sequencing and Mass Spectrometer Detection 
MATERIALS AND METHODS 

[0324] Oligonucleotides were purchasGd from Operon Technologies (Alameda, CA) in an unpurified form. Sequenc- 
ing reactions were perfomied on a solid surface using reagents from the sequencing kit for Sequenase Version 2.0 
(Amersham, Arlington Heights, Illinois). 

Sequencing a 39-mer target 

Sequencing complex; 

[0325] 


SEQUENCE SEQ ID 

NO. 

5'-TCTGGCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA-(A**),-3' 23 
5'-TGTACGTCACAACT 3' {PNA 16/DNAI 24 

[0326] In order to perform solid-phase DNA sequencing, template strand DNA11683 was 3'-biotinylated by terminal 
deoxynucleotldyl transferase. A 30 m-I reaction, containing 60 pmol of DNA116B3, 1.3 nmol of biotin 14-dATP (GIBCO 
BRL, Grand Island, NY). 30 units of terminal transferase (Amersham, Arlington Heights, jjlinois), and 1 x reaction buffer 
(supplied with enzyme), was incubated at 37"C for 1 hour, The reaction was stopped by heat Inactivation of the terminal 
transferase at 70**C for 1 0 min. The resulting product was desalted by passing through aTE-1 0 spin column (Ciontech). 
More than one molecules of biotin- 14-dATP could be added to the 3'-end of DNA116B3. The biottnylated DNA11683 
was incubated with 0.3 mg of Dynal streptavidin beads in 30 |il 1x binding and washing buffer at ambient temperature 
for 30 min. The beads were washed twice with TE and redissoived in 30 ^li TE, 10 \i\ aliquot (containing 0.1 mg of 
beads) was used for sequencing reactions. 

[0327] The 0.1 mg beads from previous step were resuspended in a 10 |il volume containing 2 ^iJ of 5x Sequenase 
buffer (200 mM Tris-HCJ, pH 7.5, 100 mM MgCia, and 250 mM NIaCI) from the Sequenase kit and 5 pmol of corre- 
sponding primer PNA 1 6/DNA. The annealing mixture was heated to 70'C and allowed to cool sfowly to room temper- 
ature over a 20-30 min time period. Then 1 |il0.1 M dithiothreitol solution. 1 jil Mn buffer (0.15 M sodium isocitrate and 
0. 1 M MgCi2), and 2 \i\ of diluted Sequenase (3.25 units) were added. The reaction mixture was divided into fouraliquots 
of 3 \l\ each and mixed with termination mixes (each contains of 3 of the appropriate termination mix: 32 c7dATP, 
32 laM dCTP, 32 c7dGTP, 32 \iM dTTP and 3.2 |xM of one of the four ddTNPs, in 50 mM NaCI). The reaction 
mixtures were incubated at 37'C for 2 min. After the completion of extension, the beads were precipitated and the 
supernatant was removed. The beads were washed twice and resuspended in TE and kept at 4"C. 

Sequencing a TS-mer target 

[0328] Sequencing complex: 

5'-AAGATCTGACCAGGGATTCGGTTAGCGTGACTQCTQCTQCTGCTGCT 
GCTeCTGQATQATCCGACGCATCAGATCTGQ-(A'')„-3'(SEQ ID NO. 25) 
(TNR.PLASM2) 

5'-CTGATGCGTCGQATCATC-3' (CMl) (SEQ ID NO. 26) 

[0329] The target TNR.PLASM2 was biotinylated and sequenced using procedures similar to those described in 
previous section (sequencing a 39-mer target). 
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Sequencing a IS-mer target with partiaiiy dupiex probe 
[0330] Sequencing complex; 

^-F-GATGATCCGACGCATCACAGCTC^' {SEQ ID No. 27) 
^'-TCGGTTCCAAGAGCTGTGATGCGTCGGATCATC-b-=*' {SEQ ID No, 28) 

[0331] CM1 B3B was immobilized on Dynabeads M280 with streptavidin (Dynal, Nofway) by incubating 60 pmol of 
CN/I1B3B with 0,3 magnetic beads in SOjil 1 M NaCI and TE (1x binding and washing buffer) at room temperature for 
30 min. Tine beads were washed twice with TE and redissolved in 30 ^i! TE, 10 or 20 ^1 aliquot (containing 0.1 or 0.2 
mg of beads respectively) was used for sequencing reactions. 

[0332] The duplex was fonmed by annealing corresponding aliquot of beads from previous step with 10 pmol of 
DF11a5F (or 20 pmol of DFIIaSFfor 0.2 mg of beads) In a 9 |xl volume containing 2 |jil of 5x Sequenase buffer (200 
mM Tria-HCI, pH 7.5, 100 mM MgCig, and 250 mM NaCI) from the Sequenase kit. The annealing mixture was heated 
to 65"C and allowed to cool siowiy to 37*'C over a 20-30 min time period. The duplex primer was then mixed with 1 0 
pmol of TS1 0 (20 pmol of TS10 for 0.2 mg of beads) in 1 ^l volume, and the resulting mixture was further incubated 
at 37^*0 for 5 min, room temperature for 5-1 0 min. Then 1 ^1 0. 1 M dithiothreitol soJution, 1 ^1 Mn buffer (0.15 M sodium 
isocitrate and 0.1 M MnCIa), and 2 (i! of diluted Sequenase (3.25 units) were added. The reaction mixture was divided 
into four afiquots of 3 \l\ each and mixed with termination mixes (each contains of 4 |jlI of the appropriate termination 
mix: 16 \iM dATP, 16 dCTP. 16 dGTP, 16 jxM dTTP and 1 .6 of one of the four ddNTPs, in 50 mM NaCI). 
The reaction mixtures were incubated at room temperature for 5 min, and SZ^'C for 5 min. After the completion of 
extension, the beads were precipitated and the supernatant was removed. The beads were resuspended in 20 jil TE 
and kept at 4"C. An aliquot of 2 ^il (out of 20 from each tube was taken and mixed with 8 jxl of formamide, the 
resulting samples were denatured at 90-95*C for 5 min and 2 |al (out of 1 0 |il total) was applied to an ALF DNA sequencer 
(Phamnacia, Piscataway, NJ) using a 1 0% polyacrylamide gel containing 7 M urea and O.BxTBE. The remaining aliquot 
was used for MALDl-TOF MS analysis. 

MALDi sampie preparation and instrumentatton 

[0333] Before MALDI analysis, the sequencing ladder loaded magnetic beads were washed twice using 50 mM 
ammonium citrate and resuspended in 0.5 ^il pure water. The suspension was then loaded onto the sample target of 
the mass spectrometer and 0.5 ^1 of saturated matrix solution (3-hydroxypicolinic acid (HPA): ammonium citrate = 1 0: 
1 mole ratio in 50% acetonltrMe) was added. The mixture was allowed to dry prior to mass spectrometer analysis. 
[0334] The reflectron TOFMS mass spectrometer (Vision 2000, Finnigan MAT, Bremen. Gennany) was used for 
analysis. 5 kV was applied in the ion source and 20 kV was applied for postaccele ration. All spectra were taken in the 
positive ion mode and a nitrogen laser was used. Normally, each spectrum was averagedfor more than 100 shots and 
a standard 25-point smoothing was applied. 

RESULTS AND DISCUSSION 

Cor}ventionaf soitcf-phase sequencing 

[0335] In conventional sequencing methods, a primer is directly annealed to the template and then extended and 
terminated in a Sanger dideoxy sequencing. Normally, a biotinyrated primer is used and the sequencing ladders are 
captured by streptavidin-coated magnetic beads. After washing, the products are eluted from the beads using EDTA 
and formamide. Previous findings indteated that only the annealed strand of a dupiex is desorbed and the immobilized 
strand remains on the beads. Therefore, it is advantageous to immobiiize the template and anneal the primer. After 
the sequencing reaction and washing, the beads with the immobilized template and annealed sequencing iadder can 
be loaded directly onto the mass spectrometer target and mix with matrix. In MALDI, only the annealed sequencing 
ladder will be desorbed and ionized, and the Immobiiized template will remain on the target. 

[0336] A 39-mer template (SEQ ID No. 23) was first biotinylated at the 3'-end by adding biotln-1 4-dATP with terminal 
transferase. More than one biotin-1 4-dATP moiecule could be added by the enzyme. Since the template was immobi- 
lized and remained on the beads during MALDI, the number of b lot! n-1 4-dATP would not affect the mass spectra. A 
14-mer primer (SEQ ID No. 24) was used for the solid-state sequencing to generate DNA fragments 3-27 below (SEQ 
ID Nos, 142-1 66). MALDl-TOF mass spectra of the four sequencing ladders are shown in Figure 44 and the expected 
theoretical values are shown in Table III. 
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TABLE 111 



5'-rCTGGCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA-{A*)„-3' 

h 

3 ' -TCAAC ACTGC ATGT- 5- 

r 

3'-ATCAACACTGCATGT-5' 

4 

3'^CATCAACACTGCATGT-5' 

5 

3'-AC ATCAACACTG C ATGT-5 ' 

6 

3'^AA C ATC AACACTG C ATGT-5 ' 

7 

3'-TAACATCAACACTQCATGT-5' 

3 

3'-ATAACATCAACACTGCATGT-5' 

9 

3'-G AT A AC ATCAAC ACTGC ATGT-5' 

10 

3'-GGATAACATCAACACTGCATGT-5' 

11 

3'-CGGATAACATCAACACTGCATGT 5' 

12 

3'-CCGGATAACATCAACACTGCATGT-5' 

13 

3 'CCCGG ATA AC ATCAACACTG CATG T- 5 ' 

14 

3' -TCCCG G ATAAC ATC A AC ACTG C ATGT- 5 ' 

15 

3'-GTCCCGGATAACATCAACACTGCATGT-5' 

16 

3' CGTCCCGGATAACATCAACACTGCATGT 5' 

17 

3'-ACGTCCCGGATAACATCAACACTGCATGT-5' 

18 

3'-CACGTCCCGGATAACATCAACACTGCATQT-5' 

19 

3'-CCACGTCCCGGATAACATCAACACTGCATGT'5' 

20 

3'-ACCACGTCCCGGATAACATCAACACTGCATGT-5' 

21 

3'-GACCACGTCCCGGATAACATCAACACTGCATGT-5' 

22 

3''GGACCACGTCCCQGATAACATCAACACTGCATGT-5' 

23 

3'-CGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 

24 

3'-CCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 

25 

3'-ACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' | 

26 

3'-G ACCGGACCACGTCCCGG AT AA€ATCAAC ACTGC ATGT-5' | 

27 

3'-AGACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' i 
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TABLE III (Continued) 



A-reaction 

C-reaction 

G-reaction 

T-reaction 






2. 

4223.8 

4223.8 

4223.8 

4223,8 

3. 

4621.1 






4809,2 



5, 

5133.4 




6- 

5434.6 




7. 




5737.8 

8. 

6051.1 




9. 



6379.2 


10. 



6704.4 




6995.6 



12. 


7284.8 



13. 


7574.0 



14. 




7878.2 

15. 



8207.4 




8495.6 




8808.8 




18. 


9097,0 



19. 


9386.2 



20. 

9699.4 




21 



10027,6 


22. 



10355.8 


23, 


10644.0 



24. 


10933,2 



25, 

1 1246.4 
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A-reactton 

C-reaction 

G-reactron T-reaction 

26. 



11574,6 j 

27, 

11886.8 




10 

[0337] The sequencing reaction produced a relatively homogenous ladder, and the full-length sequence was deter- 
nnined easily. One peak around 5150 appeared in all reactions are not identified. A possible explanation is that a small 
portion of the template formed some l<ind of secondary structure, such as a loop, which hindered sequenase extension. 
Mis-incorpo ration is of minor importance, since the Intensity of these peaks were much lowerthan that of the sequencing 
'J ladders. Although 7-deaza purines were used in the sequencing reaction, which could stabilize the N-glycosidic bond 
and prevent depurination, minor base iosses were still observed since the primer was not substituted by 7-dea2apu- 
rines. The full length ladder, with a ddA at the 3' end, appeared in the A reaction with an apparent mass of 11 899.8. A 
more intense peak of 12333 appeared In ail four reactions and is likely due to an addition of an extra nucleotide by the 
Sequenase enzyme. 

20 [0338] The same technique could be used to sequence longer DNA fragments. A 78-mer template containing a CTG 
repeat (SEQ ID No, 25) was 3'-biotinyiated by adding biotin-1 4-dATP with tenminai transferase. An 18-merprimer (SEQ 
ID No. 26) was annealed right outside the CTG repeat so that the repeat could be sequenced immediately after primer 
extension. The four reactions were washed and analyzed by MALDI-TOFMS as usual. An example of the G-reactlon 
is shown in Figure 45 (SEQ ID Nos. 167-220) and the expected sequencing ladder is shown in Table IV with theoretical 

^5 mass values for each ladder component. Ail sequencing peaks were well resolved except the last component (theo- 
retical value 20577.4) was indistinguishable from the background. Two neighboring sequencing peaks (a 62-mer and 
a 63-mer) were also separated indicating that such sequencing analysis could be applicable to longer templates. Again, 
an addition of an extra nucleotide by the Sequenase enzyme was observed in this spectrum. This addition is not 
template specific and appeared in allfour reactions which makes it easy to be Identified. Compared to the primer peak, 

30 the sequencing peaks were at much lower intensity tn the long template case. 
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TABLE IV Continued 



ddATP 

ddCTP 

ddQTP 

ddTTP 

■1 

5491 .6 

5491 ,6 

5491.6 

5491.6 

2. 


5764.8 



3. 

6078.0 




j\ 

4. 



6407.2 




6696.4 



6. 

7009,6 




7. 



7338.8 


8. 


7628.0 



9, 

7941,2 




10. 



8270.4 


1 1 - 


8559.6 

■ 


12. 

8872.8 




1 3. 



9202.0 




9491 .2 



1 C 

1 D, 

9804.4 



„ 

16. 



10133.6 


1 7. 


10422.88 



18. 

10736.0 




1 9, 



1 1065.2 


II • 





21. 

1 1667.6 




22. 



11996.8 


23. 


12286,0 



24. 

12599,2 
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25. 



12928.4 


26. 




13232.6 

27. 


13521.8 



28. 

13835.0 




29. 


14124.2 



30. 



14453.4 


31. 


14742.6 



32. 




1 5046.8 

33. 

15360.0 




34. 

15673.2 




35. 


15962.4 



36. 


16251.6 



37. 



16580.8 


38. 

16894.0 




39. 

17207.2 




40. 




17511.4 

41. 


17800.6 



42. 


18189.8 



43. 


18379.0 



44. 




18683.2 

45. 



19012.4 


46. 



19341.6 


47. 




19645.8 

48. 


19935.0 



49. 

20248.2 




50. 



20577.4 
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llsi. 

20890.6 




1 52. 




21194.4 

53. 


21484.0 



54. 




21788.2 

55. 




22092.4 


Sequencing using duplex DNA probes lor capturing and priming 

[0339] Dupiex DNA probes with single-stranded overhang have been demonstrated to be able to capture specific 
DNA tempfates ard also serve as primers for solid-state sequencing. The scheme is shown In Figure 46. Stacking 
Interactions between a duplex probe and a single-stranded template allow only a 5-base overhang to be sufficient for 
capturing. Based on this fomriat, a 5' ffuorescent-labeled 23-mer (S'-GAT GAT CCG ACG CAT CAC AGC TC-3') (SEQ 
ID No. 29) was annealed to a S'-biotinylated 1 B-mer {5'-GTG ATG CGT CGG ATC ATC-3') (SEQ ID No. 30), ieaving a 
5-base overhang. A 15-mer template (S'-TCG GTT CCA AGA GCT-3') (SEQ ID No. 31) was captured by the dupJex 
and sequencing reactions were performed by extension of the 5-base overhang, MALDI-TOF mass spectra of the 
reactions are shown in Figure 47A-D. All sequencing peaks were resolved although at relatively low Intensfties. The 
last peak fn each reaction is due to unspecific addition of one nucleotide to the full length extension product by the 
Sequenase enzyme. For comparison, the same products were run on a conventional DNA sequencer and a stacking 
fluorogram of the results is shown in Figure 48. As can be seen from the Figure, the mass spectra had the same pattern 
as the fluorogram with sequencing peaks at much lower intensity compared to the 23-mer primer 

EXAMPLE 10 

Thermo Sequenase Cycle Sequencing 
MATERIALS AND METHODS 

[0340] PCH ampmicatjon. Human leukocytic genomic DNA was used for PGR amplification. PGR primers to amplify 
a 209 bp fragment of the p-globin gene were the p2 fon^vard primer (5'-CAT TTG CTT CTG ACA CAA CTG-3' SEQ ID 
NO. 32) and the pi 1 reverse primer {5'-CTT CTC TGT CTC GAG ATG C-3' SEQ ID NO. 33). Taq polymerase and lOx 
buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs from Pharmacia (Freiburg, Gemiany). The 
total reaction volume was 50 ^1 including 8 pmoi of each primer with approximately 200 ng of genomic DNA used as 
template and a final dNTP concentration of 200 ^M, PGR conditions were: 5 min at 94*C, followed by 40 cycles of 30 
sec at 94*0, 46 sec at 53*C, 30 sec at 72*C, and a final extension time of 2 min at 72'C. The generated amplified 
product was purified and concentrated {2x) with the Qiagen 'Qiaquick' PGR purification kit (#281 06) and stored in HgO. 
[0341] Cycle Sequencing. Sequencing ladders were generated by primer extension with Themio Sequenase™ -DNA 
Polymerase (Amersham LIFE Science. #E79000Y) under the following conditions: 7 pmoi of HPLC purified primer 
(CodS 12mer: 5'-TGC ACCTGA GTC-3' SEQ ID No. 34) were added to 6nl purified and concentrated amplified product 
(i.e. 12jal of the original amplified product), 2.5 units Thermo Sequenase and 2,6 ml Thermo Sequenase reaction buffer 
in a total volume of 25^1. The final nucleotide concentrations were 30jxM of the appropriate ddNTP (ddATP, ddCTP, 
ddGTP orddTTP; Pharmacia Biotech, #27-2045-01) and 210^lM of each dNTP (7-deaza-dATP. DCTP, 7-deaza-GTP 
dTTP; Pharmacia Biotech). 

[0342] Cycling conditions were: denaturation for 4 min at 94*C, followed by 35 cycles of 30 sec at 94*C, 30 sec at 
38*C, 30 sec at 55*C, and a final extension of 2 min at 72*C. 

[0343] Sample preparation and analysis by MALDI-TOF MS. After completion of the cycling program, the reaction 
volume was increased to 50p.l by addition of 2S\l\ HgO. Desalting was achieved by shaking 30|ii of ammonium saturated 
DOWEX (Fluka #44485) cation exchange beads with 50 ^1 of the analyte for 2 min at room temperature. The Dowex 
beads, purchased in the protonated fomn, were pre-treated with 2M NH4OH to convert them to the ammonium fonn, 
then washed with HgO until the supernatant was neutral, and finally put in 10 mM ammonium citrate for usage. After 
the cation exchange, DNA was purified and concentrated by ethanol precipitation by adding 5^1 3 M ammonium acetate 
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(pH 6.5), 0.5 ^1 glycogen (10 mg/mJ, Sigma), and 110 j^l absolute ethanol to the anaJyte and incubated at room tem- 
perature for 1 hour. After 12 min centrifugation at 20,000 X g the pellet was washed in 70% ethanol and resuspended 
in 1 III 18 Mohmycm H^O water 

[0344] For MALDI-TOF MS analysis 0,35 ^il of resuspended DNA was mixed with 0.35-1 ,3 ^1 matrix solution (0.7 M 
3-hydroxypicoJlnic acid (3-HPA), 0.07 M ammonium citrate in 1 :1 HaOiCHaCN) on a stainless steel sample target disk 
and allowed to air dry preceding spectrum acquisition using a Themno Btoanalysis Vision 2000 MALD(-TOF operated 
in refiectron mode with 5 and 20 kV on the target and conversion dynode. respectively. External calibration generated 
from eight pealts (3000-1 8000 Da) was used for all spectra. 

RESULTS 

[0345] FIGURE 49 shows a MALDI-TOF mass spectrum of the sequencing ladder generated from a biological am- 
plified product as template and a 12mer (5'-TGC ACC TGA CTC-3'(SEQ ID N0.34)) sequencing primer. The peal<s 
resulting from depurinations and peaks which are not related to the sequence are mari<ed by an asterisk. MALDI-TOF 
MS measurements were taken on a refiectron TOF MS. A.) Sequencing ladder stopped with ddATP; B.) Sequencing 
ladder stopped with ddCTP; C) Sequencing ladder stopped with ddGTP; D.) Sequencing ladder stopped with ddTTR 
[0346] FIGURE 50 shows a schematic representation of the sequencing ladder generated In Fig, 49 wfth the corre- 
sponding calculated molecular masses up to 40 bases after the primer (SEQ ID Nos 221-260). For the calculation the 
following masses were used: 3581 .4 Da for the primer, 312.2 Da for 7-deaza-dATP, 304.2 Da for dTTP, 289.2 Da for 
dCTP and 328.2 Da for 7-dea2a-dGTP 

[0347] FIGURE 51 shows the sequence of the amplified 209bp amplified product within the p-globin gene (SEQ ID 
No. 261), which was used as a template for sequencing. The sequences of the appropriate PGR primer and the location 
of the 12mer sequencing primer is also shown. This sequence represents a homozygote mutant at the position 4 after 
the primer. In a wildtype sequence this T would be replaced by an A. 

EXAMPLE 11 

MJcrosatelltte Analysis Using Primer OlJgo Base Extension (PROBE) and IWALDI-TOF iVIass Spectrometry 
SUMMARY 

[0348] The method uses a single detection primer followed by an oligonucleotide extension step to give products 
differing in length by a number of bases specific for the number of repeat units or for second site mutations within the 
repeated region, which can be easily resolved by MALDI-TOF mass spectrometry. The method is demonstrated using 
as a model system the AluVpA polymorphism in fntron 5 of the interferon-a receptor gene located on human chromo- 
some 21 , and the poly Ttract of the splice acceptorslte of intron 8 from the CFTR gene located on human chromosome 


MATERIALS AND METHODS 

[0349] Genomic DNA was obtained from 1 8 unrelated individuals and one family including of a mother, father, and 
three children. The repeated region was evaluated conventionally by denaturing gef electrophoresis and results ob- 
tained were confimied by standard Sanger sequencing, 

10350] The primers for PGR amplification (8 pmol each) were IFIMAR-IVS6-5*: (5'-TGC TTA CTT AAC CCA GTG TG- 
3' SEQ ID. N0.35) and IFNAR-IVS5-3'.2: (5'-CAC ACT ATG TAA TAG TAT GC-3" SEQ ID. N0.36) for a part of the 
intron 5 of the inteiteron-ct receptor gene, and CFEx9-F:(5'-GAA AAT ATC TGA CAA ACT CAT C-3' SEQ ID NO 37) 
{5'-biotinylated) and CFEx9-Rr(5'-CAT GGA CAC CAA ATT AAG TTC-3'SEQ ID. N0.38) for CFTR exon 9 With flanlting 
intron sequences of the CFTR gene. Taq -polymerase including 1 0x buffer were purchased from Boehringer-Mannheim 
and dNTPs were obtained from Phannacia. The total reaction volume was 50 ^1 . PGR conditions were 6 mIn at 94**C 
followed by 40 cycles of; 1 min at 94'C, 45 sec at 53'>C, and 30 sec at 72'C. and a final extension time of 5 min at 
[0361] Amplification products were purified using QIagen's PCR purification kit (No.281 06) according to manufac- 
turer's instructions. Purified products were eluted from the column in 50 jif TE-buffer (1 OmM Tris-HCI, 1 mM EDTA, pH 
7,5). 

A) Primer oligo base extension reaction (thermo cycling method 

[0352] CyclePROBE was perfonned with 5 pmof appropriate detection primer (IFN:5'-TGA GACTCT GTC TC-3'SEQ 
ID. N0.39) in a total volume of 25 ^il including I pmol purified template, 2 units Thermosequenase (Amersham Life 
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Science, Cat. #E79000Y) 2.5 jil Thermosequenase buffer, 25 iimol of each deoxynucleotide (7-deaza-dATP, dTTP, 
and in some experiments extra dCTP) and 100 ^mol of dideoxyguanine and in some experiments additional ddCTR 
Cycling conditions: initial denaturation 94*C for 5 min followed by 30 cycJes with 44'C annealing temperature for 30 
sec and 55'C extension temperature for 1 min. 

Primer oUgo base extension react/on (isothermaf method) 

[0353] 10 \x\ ailquots of the purified double-stranded amplified product (-3 pmol) were transferred to a streptavidin- 
coated microliter plate well (-16 pmol capacity per 50 ^il volume; No. 1645684 Boehrlnger-Mannheim), followed by 
addition of 1 0 ^1 incubation buffer (80 mM sodium phosphate, 400 mM NaCI. 0.4% Tween 20, pH 7.5) and 30 ^il water. 
After incubation for 1 hour at room temperature, the wells were washed three tfmes with 200 jj.1 washing buffer A (40 
mM Tris, 1 mM EDTA, 50 mM NaCI, 0.1% Tween 20. pH 8.3) and incubated with 100 ^li of 50 mM NaOH for 3 min to 
denature the double-stranded DNA, Finally, the wells were washed three times with 200 ]al 70 mM ammonium citrate 
solution. 

[0354] The annealing of 100 pmol detection primer (GFpT: 5'-TTC CCC AAA TCC CTG-3' SEQ ID NO. 40) was 
perfomned in 50 ^il annealing buffer (50 mM ammonium phosphate buffer, pH 7.0 and 1 00 mM ammonium chloride) at 
65**C for 2 min, at 37*0 for 10 min, and at room temperature for 10 min. The wells were washed three times with 200 
\x\ washing buffer B (40 mM Tris, 1 mM EDTA, 50 mM NH4CI, 0.1% Tween 20, pH 8.8) and once in 200 \x\ TE buffer. 
The extension reaction was performed using some components of the DNA sequencing kit from USB (No. 70770) and 
dNTPs or ddNTPs from Pharmacia. Total reaction volume was 45 ^tl, containing of 21 ^il water, 6 jj.1 Sequenase -buffer, 
3 ^il 100 mM DTT solution, 50 ^imol of 7-deaza-dATP, 20 ^imoi ddCTP, 6.5 ^1 glycerol enzyme dilution buffer, 0.26 ^1 
Sequenase 2.0. and 0.25 |il pyrophosphatase. The reaction was pipetted on ice and incubated for 15 min at room 
temperature and for 5 min at 37*0. Finally,the wells were washed three times with 200 \i\ washing buffer B. 
[0355] The extended primer was denatured from the template strand by heating at SO*C for 10 min in 50 ^li of a 50 
mM ammonium hydroxide solution, 

[0356] For precipitation, 10 |iil 3 M NH ^-acetate (pH 6.5), 0.5 \i\ glycogen (10 mg/mi water, Sigma, Cat.#G1765), 
and 1 1 0 ^1 absolute ethanol were added to the supernatant and incubated for 1 hour at room temperature. After cen- 
trif ugation at 1 3. 000 g for 1 0 min the pel let was washed in 70% ethanof and resuspended in 1 p.1 1 8 Mohm/cm HgO water. 
[0357] Sample preparation was performed by mixing 0.6 ^1 of matrix solution (0.7 M 3-hydroxypicolrnic acid, 0.07 M 
dibasic ammonium citrate in 1 :1 HgOiCHaCN) with 0.3 ^1 of resuspended DNA/glycogen pellet on a sample target and 
allowed to air dry. Up to 20 samples were spotted on a probe target disk for introduction into the source region of a 
Themno Bioanalysis (fonnerfy Finnigan) Visions 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on 
the target and conversion dynode, respectively. Theoretical average molecular mass (Mr(calc)) were cateulated from 
atomic compositions; reported experimental M^ (Mr(exp)) values are those of the singly-protonated form, determined 
using external calibration. 


RESULTS 


[0358] The aim of the experiments was to develop a fast and reliable method for the exact determination of the 
number of repeat units in microsateflites or the length of a mononucleotide stretch including the potential to detect 
second site mutations within the polymorphic region. Therefore, a special kind of DNA sequencing (primer oligo base 
extension, PROBE) was combined with the evaluation of the resulting products by matrix-assisted laser desorption 
ionization (MALDI) mass spectrometry (MS). The time-of-flight (TOF) reflectron arrangement was chosen-as a possible 
mass measurement system. As an initial feasfbifity study, an examination was performed first on an AluVpA repeat 
polymorphism located in Intron 5 of the human interferon-a receptor gene (cycfePROBE reaction) and second on the 
poly T tract located in intron 8 of the human CFTR gene (isothermal PROBE reaction). 

[0359] A schematic presentation of the cyclePROBE experiment for the AluVpA repeat polymorphism is given in 
Figure 52. The extension of the antisense strand (SEQ ID No. 262) was performed with the sense strand serving as 
the template. The detection primer is underlined. In a family study co-dominant segregation of the various alleles could 
be demonstrated by the elect rophoretic procedure as well as by the cyciePROBE method followed by mass spec 
analysis (Figure 53). Those alleles of the mother and child 2, for which direct electrophoresis of the amplified product 
indicated one of the two copies to have 13 repeat units,were measured using cyciePROBE to have instead only 11 
units using ddG as temiinator. The replacement of ddG by ddC resulted in a further unexpected short allele wfth a 
molecular mass of approximately 1 1 650 in the DNA of the mother and child 2 (Figure 54). Sequence analysis verified 
this presence of two second site mutations in the allele with 13 repeat units. The first is a C to T transition In the third 
repeat unit and the second mutation is a T to G transversion in the ninth repeat unit. Examination of 28 unrelated 
individuals shows that the 13 unit allele Is spliced into a nonnal allele and a truncated allele using cyciePROBE. Sta- 
tistical evaluation shows that the polymorphism is in Hardy- Weinberg equilibrium for both methods, however, using 
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cyclePROBE as detection method the polymorphism information content is increased to 0.734. 
[0360] PROBE was also used as an Isothermic method for the detection of the three common a\\e\es at the intron 8 
splice acceptor site of the CFTR gene (SEQ ID No. 263). Figure 55 shows a schematic presentation of the expected 
diagnostic products (SEQ ID Nos. 264-266) with the theoretical mass values. The reaction was also performed in the 
antisense direction. 

[0361] Figure 56 demonstrates that all three common alleles (T6, T7, and T9, respectively) at this locus coutd be 
reliably disclosed by this method. Reference to Figure 56 indicates that mass accuracy and precision with the refiectron 
time of flight used in this study ranged from 0-0.4%, with a relative standard deviation of 0.13%. This corresponds to 
far better than single base accuracy for the up to <90-mer diagnostic products generated in the IFNAR system. Such 
high analytical sensitivity Is sufficient to detect single or multiple insertion/deletion mutations within the repeat unit or 
its flanking regions, which would induce > 1 % mass shifts in a 90-men This is analogous to the Figure 56 polyT tract 
analysis. Other mutations (i.e. an A to T or a T to A mutation withm the IFNAR gene AST repeat) which do not cause 
premature product termination are not detectable using any dNTP/ddNTP combination with PROBE and low perform- 
ance MS Instrumentation; a 9 Da shift in a 90-mer corresponds to a 0.03% mass shift. Achieving the accuracy and 
precision required to detect such minor mass shifts has been demonstrated with higher performance instrumentation 
such as Fourier transfomn (FT)MS, for which single Da accuracy is obtained up to 100-mers. Further, tandem FTMS, 
in which a mass shifted fragment can be isolated within the instrument and dissociated to generate sequence specific 
fragments, has been demonstrated to locate point mutations to the base in comparably sized products. Thus the com- 
bination of PROBE with higher performance instrumentation will have an analytical sensitivity which can be matched 
only by cumbersome full sequencing of the repeat region. 

EXAMPLE 12 

Improved Apollpoprotein E Genotyping Using Primer Oligo Base Extension (PROBE) and MALDI-TOF Mass 
Spectrometry 

MATERIALS AND METHODS 

PGR amptification. 

[0362] Human leukocytic genomic DNA from 100 anonymous individuals from a previously published study (Braun, 
A etaL, (1 992) Human Genet. 89 :401 -406) were screened for apollpoprotein E genotypes using conventional methods! 
PGR primers to amplify a portion of exon 4 of the apo E gene were delineated according to the published sequence 
(Das, HK et al,, (1985) J. Biol. Chem. 260 :6240-6247) (forward primer, apoE-F: 5''GGC ACG GCT GTC CAA GGA G- 
3'SEQ ID. N0.41 ; reverse, apoE-n: 5'-AGG CCG CGC TCG GCG CCC TC-3'SEQ ID. N0.42). Taq polymerase and 
1 0x buffer were purchased from Boehrlnger-Mannheim (Germany) and dNTPs from Pharmacia (Freiburg, Gennany). 
The total reaction volume was 50 including 8 pmol of each primer and 1 0% DMSO (dimethylsulfoxide, Sigma) with 
approximately 200 ng of genomic DNA used as template. Solutions were heated to SO'C before the addition of 1U 
polymerase; PGR conditions were: 2 min at 94''C, followed by 40 cycles of 30 sec at 94'Cp 45 sec at 63'C^ 30 sec at 
72*C, and a final extension time of 2 min at 72*C. 

Restriction enzyme digestion and potyacrylamide etectrophoresis. 

[0363] Cfol and Rsal and reaction buffer L were purchased from Boehrlnger-Mannheim, and Hhal from Pharmacia 
(Freiburg, Gemiany). For Cfol alone and simultaneous Cfol/Rsal digestion, 20 pL of amplified products were diluted 
with 1 5 ^ll water and 4 pL Boehringer-Mannheim buffer L; after addition of 10 units of appropriate restriction enzyme 
(s) the samples were incubated for 60 min at 37*0. The procedure for simultaneous Hha(/Rsal digestion required first 
digestion by Rsai in buffer L for one hour followed by addition of NaCf (50 mM end concentration) and Hhal, and 
additional incubation for one hour. 20 pL of the restriction digest were analyzed on a 12% polyacrylamide gel as de- 
scribed elsewhere (Hixson (1990) J. Lipid Res, 31:545-548), Recognition sequences of Rsal and Cfol (Hhal) are GT/ 
AC and GCG/C, respectively; masses of expected digestion fragments from the 252-mer amplified product with Cfol 
alone and the simultaneous double digest wfth Cfol (or Hhal) and Rsal are given in Table V. 

Thermo-PROBE. 

[0364] PCR amplification was perfonned as described above, but with products purified with the Qiagen" Qiaquici<' 
kit to remove unincorporated primers. Multiplex Thermo-PROBE was performed with 35 ^1 amplified product and 8 
pmol each of the codon 1 1 2 (5'-GCG GAC ATG GAG GAG GTG-3* SEQ ID. NO,4a) and 1 58 (5'-GAT GCC GAT GAC 
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CTG CAG AAG-3' SEQ ID. NO .44) detection primers in 20^1 fncluding - 1 pmol purified blotinylated antisense template 
innmobiJized on streptavidin coated magnetic beads, 2.5 units Thermosequenase, 2^1 Therm osequenase buffer, 50 
I^M of each dNTP and 200 ^iM of ddXTP, with the base identhy of N and X as described in the text. Cyciing conditions 
were: denaturation (94»C, 30 sec) foiiowed by 30 cycles at 94'C (10 min) and 60^*0 (45 sec). 

Sample preparation and anaiysis by MALDf-TOF MS. 

[0365] For precipitation (Stuits etaL, ( 1 99 1 ) Rapid Common . Uass Spectrom . 5 : 359-363) of both digestsand PROBE 
products, S\l\ 3M ammonium acetate (pH 6.5), 0.5^i] glycogen (1 0 mg/mi, Sigma), and 110 ^i absolute ethanol were 
added to 50 ^1 of the anaiyte solutions and stored for 1 hour at room temperature. After 10 min centrifugatlon at 1 3,000 
X g the pellet was washed in 70% ethanol and resuspended in 1 piJ 1 8 IVIohm/cm H^O water. Where noted in the text, 
additional desalting was achieved by shaking 10-20^L of ammonium saturated DOWEX (Fluka #44485) cation ex- 
change beads in 40^L of anaiyte. The beads, purchased in the protonated form, were pre-treated with three 6 min 
spin-decant steps In 2M NH4OH, foiiowed with H2O and 10 mM ammonium citrate. 

[0366] 0.35|aL of resuspended DNA was mixed with 0.35-1 .3hL matrix solutions (Wu et al. (1 993) Rapid Commun. 
Mass Spectrom. 7:142^146) 0.7 M 3-hydroxypicolinrc acid (3-HPA), 0.07 M ammonium citrate in 1 :1 HaOiCHaCN) on 
a stainless steel sample target disk and allowed to air dry preceding spectrum acquisition using a Thermo Bioanaiysis 
Vision 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respec- 
tively. Theoretical average molecular masses (Mr(calc)) of the fragments were calculated from atomic compositions; 
the mass of a proton (1.08 Da) is subtracted from raw data values in reporting experimental moiecular masses (M^ 
(exp)) as neutral basis. An external calibration generated from eight peaks (3000-18000 Da) was appiled to all spectra. 

RESULTS 

Digestion with Cfol aione. 

[0367] The inset to Figure 57a shows a 12% polyacryiamide gel eiectrophoretic separation of an G3/e3 genotype 
after digestion ofthe252 bp apoE amplified product with Cfol. Comparison of the eiectrophoretic bands with a molecular 
weight ladder shows the cutting paUern to be as mostly as expected (Table V) for the G3/G3 genotype. Differences 
are that the faint band at approximately 25 bp is not expected, and the smallest fragments are not obsefved. The 
accompanying mass spectrum of precipitated digest products shows a similar pattern, albeit at higher resolution. Com- 
parison with Table Vshows that the observed masses are consistent with those of single-stranded DNA; the combination 
of an acidic matrix environment (3-HPA. pK^ 3) and the absorption of thermal energy via interactions with the 337 nm 
absorbing 3-HPA upon ionization is known to denature short stretches of dsDNA under nomial MALDl conditions (Tang, 

£t 31., (1994) Rapid Commun Mass Spectrom 8:183-186). 
[0368] The approximately 25-mers, unresolved with electrophoresis, are resolved by MS as three single stranded 
fragments; whiie the largest (7427 Da) of these may represent a doubly charged ion from the 14.8 kDa fragments (m 
= 1 4850. z = 2; m/z = 7425), the 671 5 and 71 53 Da fragments could result from PGR artifacts or primer impurities; all 
three peaks are not observed when amplified products are purified with Qiagen purification kits prior to digestion. The 
Tabie V 8871 Da 29-mer sense strand 3'-terminai fragment is not observed; the species detected at 91 86 Da is con- 
sistent with the addition of an extra base (9187 - 8871 ^ 316, consistent with A) by the Taq-poiymerase during PCR 
amplification (Hu. G et aL. (1993) DNA and Cell Biol 12:763-770). The individual single strands of each double strand 
with <35 bases (11 kDa) are resolved assingie peaks; the48-base single strands (Mr(calc) 14846 and 14858), however, 
are observed as an unresolved single peak at 14850 Da, Separating these into single peaks would require a mass 
resolution (nVf^m, the ratio of the mass to the peak width at half height) of 14850/13- 1140. nearly an orderof magnitude 
greater than what is routine with the standard reflectron time-of-fllght instrumentatfon used in this study; resolving such 
small mass differences with high perfonmance instrumentation such as Fourier transfonn MS, which' provides up to 
three orders of magnitude higher resolution in this mass range, has been demonstrated. The 91-mer single strands 
{Mr(calc)27849 and 28436) are also not resolved, even though this requires a resoiution of only < 50. The dramatic 
decrease in peak quality at higher masses is due to metastable fragmentation (i.e. depurination) resuiting from excess 
intemai energy absorbed during and subsequent to laser irradiation. 

Simultaneous digestion with Cfoi and Rsal. 

[0369] Figure 57b (inset) shows a 1 2% polyacryiamide gei electrophoresis separation of G3/G3 double digest prod- 
ucts, with bands consistent with dsDNA with 24, 31, 36, 48, and 55 base pairs, but not for the smaller fragments 
Although more peaks are generated (Table V) than with Cfol alone, the corresponding mass spectrum is more easiiy 
interpreted and reproducible since all fragments contain < 60 bases, a size range far more appropriate for MALDf-MS 
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if reasonably accurate values (e.g. , 0.1%) are desired. For fragments in this mass range, tlie mass measuring 
accuracy using externai calibration is -0.1% (i.e. < + 10 Da at 10 kDa). Significant depurination (indicated in Figure by 
asterisk) is observed for all peaks above 10 kDa, but even the largest peak at 17171 Da is clearly resolved from its 
depurination peak so that an accurate can be measured. Although molar concentrations of digest products should 
be identical, some discrimination against those fragments with <1 1 bases fs observed, probably due to their loss in the 
ethanoi/giycogen precipitation step. The quality of MS resuits from simultaneous digestion with Cfol (or Hhal) and Rsal 
is superior to those with Cfol (or Hhal) alone, since the smalier fragments generated are good for higher mass accuracy 
measurements, and with ail genotypes there is no possibility for dimer peaks overlapping with high mass diagnostic 
peaks. Since digestion by Rsal/Cfol and Rsal/Hhal produce the same restriction fragments but the former may be 
performed as a simultaneous digest since their buffer requirements are the same, this enzyme mixture was used for 
all subsequent genotyping by restriction digest protocols. 


Table V 


15 

Mass and Copy Number of Expected Restriction Digest Products 


Table Va f^fol nrn^iafinn 

i 


(+) (-) 

e2/e2 

e2/e2 

e2/e2 

e2/e2 

e2/e2 

e2/e2 


5781 , 5999 



1 


1 

2 

20 

10752, 10921 


1 

1 

2 

2 

2 


14645. 14858 


1 

1 

2 

2 

2 


22102, 22440 



1 


1 

2 

25 

26575, 25763 

2 

1 

1 





27849, 28436 

2 

2 

1 

2 

1 



Table Vb. Cfol/Rsal DJgc 

«tion^ 





(+) (-) 

e2/e2 

e2/e3 

e2/e4 

e3/e3 

e3/e4 

e4/e4 

30 

3428. 4025 


1 

1 

2 

2 

2 


5283, 5880 



1 


1 

2 


5761 , 5999 



1 


1 

2 

35 

11279, 11627 

2 

2 

1 

2 

1 



14845, 14858 


1 

1 

2 

2 

2 


18269, 18848 

2 

2 

1 





^fol Invariant fragment masses: 1846, 2177, 2186, 2435, 4924. 5004, 5412. 5750. 8871, 9628 Da, 
^Cfol/Rsaf Invariant fragment masses: 1B48, 2177. 2186. 2436, 4924. 5004. 5412, 5750, 6745, 7510, 8371 , 9628. 16240, 17175 Da. 


Table VI 



ddT Mr (Calc) 

ddT Mr (Exp) 

ddC Ms (Calc) 

ddC M, (Exp) 

e2/e2 

^5918, '=6768 


«6536, ^57387 


e2/e3 

^59 18, t>676e,b7965 

5919, 6769. 7967 

^536. '>6753, ^7387 

6542, 6752. 7393 

e2/64 

3591 8, t>6766, ^7965, ^8970 


^5903, ^6536, ^6753, ^7387 


e3/e3 

*5918, ^7965 

5918, 7966 

^6536, ^6753 

6542, 6756 

e3/e4 

^918,^7965, ^8970 

5914, 7959, 8965 

s5903,t'6536,*'6753 

5898, 6533, 6747 

64/e4 

^7965, ^89 70 

7966, 8969 

«5903, ^6753 

5900, 6752 


^From codon 112 detection primer (unextended 5629.7 Da). 


55 ''From ccxJon 1 58 detection primer (unexlended 6480.3 Da). 

Dashed lines: thte genotype not available from the analyzed pool of 100 pattenls. 

[0370] Figure 58a-c shows the ApoE G3/G3 genotype after digestion with CfoJ and a variety of prectpltation sch. 
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equal volume aliquots of the same amptified product were used for each. The sample treated with a single precipitation 
(Figure 58a) from an ammonium acetate/ethanol/glycogen solution results in a mass spectrum characterized by broad 
peaks, especially at high mass. The masses for intense peaks at 5.4, 10.7, and 14.9 kDa are 26 Da (0.5%). 61 Da 
(0.6%), and 45 Da (0.3%) Da higher, respectively, than the expected values; the resolution (the ratio of a peak width 
at half its total intensity to the measured mass of the peak) for each of these is -50, and decreases with Increasing 
mass. Such observations are consistent with a high level of nonvolatile cation adduction; for the 10.8 IcDa fragment, 
the observed mass shift is consistent with a greater than unit ratio of ad ducted; nonadducted molecular Ions. 
10371] MS peaks from a sample redissolved and precipitated a second time are far sharper (Figure 58b), with res- 
olution values nearly double those of the corresponding Figure 58a peaks, Mass accuracy values are also considerably 
improved; each is within 0.07% of Its respective calculated values, close to the independently determined Instrumental 
limits for DNA measurement using 3-HPA as a matrix. Single (not shown) and double (Figure 58C) precipitations with 
Isopropyl alcohol (IPA) instead of ethanol result In resolution and mass accuracy values comparable to those for cor- 
responding ethanol precipitations, but enhanced levels of dimerization are obsen/ed, again potentially confusing meas- 
urements when such dimers overlap with higher mass "diagnostics" monomers present in the solution. EtOH/ammo- 
nium acetate precipitation with gfycogen as a nucleation agent results in nearly quantitative recovery of fragments 
except for the 7-mers, serving as a simultaneous concentration and desalting step prior to MS detection. Precipitation 
from the same EtOH/ammonium acetate solutions in the absence of glycogen 'results in far poorer recovery, especially 
at low mass. 

[0372] The results indicate thatto obtain accurate (Mr(exp) values after either 1 PA and EtOH precipitations, a second 
precipitation is necessary to maintain high mass accuracy and resolution. 

[0373] The ratio of matrix:digest product also affects spectrai quality; severe suppression of higher mass fragments 
(not shown) observed with 1:1 volume matrix: digest product (redissolved in l^iL) is alleviated by using a 3 - 5 fold 
volume excess of matrix. 

[0374] Apo E genotyping by enzymatic digestion. Codon 11 2 and 1 58 polymorphisms fali within Cfol (but not Rsal) 
recognition sequences. In the 252 bp amplified product studied here, invariant (i.e. cut in all genotypes) sites cause 
cuts after bases 31 , 47, 138, 156, 239, and 246. The cutting site after base 66 is only present for e4, while that after 
base 204 is present in g3 and G4; the G2 genotype is cut at neither of these sites. These differences in the restriction 
pattern can be demonstrated as variations in mass spectra. Figure 59 shows mass spectra from several ApoE geno- 
types available from a pool of 1 00 patients (Braun, A etat., (1 992) Hum. Genet. 89:401 ^06V Vertical dashed lines are 
drawn through those masses corresponding to the expected Table V diagnostic fragments; other labeled fragments 
are invariant. Referring to Table V, note that a fragment is only considered "invariant" if it is present in duplicate copies 
for a given aileie; to satisfy this requirement, such a fragment must be generated in each of the E2m G3, and g4 alleles, 
[0375] Thespectrumin Figure 59a contains ail ofthe expected Invariant fragments above 3 kDa, as well as diagnostic 
peaks at 3428 and 4021 (both weak), 11276 and 11 627 (both intense), 14845, 18271 , and 18865 Da. The spectrum 
in Figure 59b is nearly identical except that the pair of peaks at 1 8 kDa is not detected, and the relative peak intensities, 
most notably among the 11-18 kDa fragments, are different. The spectrum In Figure 59c also has no 1 8 kDaf ragments' 
but instead has new low Intensity peaks between 5-6 kDa. The intensity ratios for fragments above 9 kDa are similar 
to those of Figure 59b except for a relatively lower 11 kDa fragment pair. Figure 59d, which again contains the 5-6 kDa 
cluster of peaks, is the only spectrum with no 11 kDa fragments, and like the previous two also has no 1 8 kDafragment. 
[0376] Despfte the myriad of peaks in each spectrum, each genotype can be identified by the presence and absence 
of only a few of the Table Vb diagnostic peaks. Due to the limited resolution of the MALDI-TOF instrumentation em- 
ployed, the most drfficult genotypes to differentiate are those based upon the presence or absence of the four diagnostic 
fragments between 5.2 and 6.0 kDa characteristic of the e4 allele, since these fragments nearly overlap with several 
invariant peaks. It has been found herein that the 5283 Da diagnostic fragment overlaps with a depurination peak from 
the 5412 Da invariant fragment, and the 5781 Da diagnostic peak fs normally not completely resolved from the 5750 
Da Invariant fragment. Thus, distinguishing between an E2/e4 and e2/G3, or between an E3/E4 and an e3/E3 allele, 
relies upon the presence or absence of the 5880 and 5999 Da fragments. Each of these is present in Figures 59c and 
59d, but not in 59a or 59b. 

[0377] The genotype of each of the patients in Figure 59 can be more rapidly Identified by reference to the flowchart 
In Figure 60. Consider the Figure 59a spectrum. The Intense pair of peaks at 11 kDa discounts the possibility of ho- 
mozygous G4, but does not differentiate between the other five genotypes. Likewise, the presence of the unresolved 
14.8 kDa fragments is inconsistent with homozygous G2, but leaves four possibilities (G2/G3, e2/G4, G3/G3, G3/G4), 
Of these only G2/E3 and E2/E4 are consistent with the 1 8 kDa peaks; the lack of peaks at 5283," 5879, 5779. and 
5998, Da indicate that the Figure 59a sample is E2/E3. Using the same procedure, the Figures 59b-d genotypes can 
be identified as E3/E3. E3/E4, and E4ye4, respectively. To date, all allele fdentifications by this method have been 
consistent with, and in many cases more easily interpreted than, those attained via conventional methods. The assign- 
ment can be further confirmed by assuring that fragment intensity ratios are consistent with the copy numbers of Table 
V. For instance, the 1 4.8 kDa fragments are of lower intensity than those at 1 6- 1 7 kDa In Figure 59a. but the opposite 
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is seen in Figures 59b-d. This is as expected, since in the iatter three genotypes the 14.8 kDa fragments are present 
in duplicate, but the first is a heterozygote containing e2, so that half of the amplified products do not contribute to the 
14.8 kDa signal, Likewise, comparison of the 11 kDafragnnent intensify to those at 9.6 and 14.8 kDa indicate that this 
fragment is double, double, single, and zero copy In Figures 59a, d, respectively. These data confirm that MALDI can 
perfonn in a semi-quantitative way under these conditions. 

[0378] ApoEgenotyping by Primer Otf go Base Extension (PROBE). The PROBE reaction was also tested as a means 
of simultaneous detection of the codon 1 12 and 1 58 polymorphisms. A detection primer is annealed to a single-stranded 
PCR>amplified template so that its 3' terminus is just downstream of the variable site. Extension of this primer by a 
DNA polymerase in the presence of three dNTPs and one ddXTP (that is not present as a dNTP) results in products 
whose length and mass depend upon the identity of the polymorphic base. Unlike standard Sanger type sequencing, 
in which a particular base-specific tube contains -99% dXTP and -1% ddXTP, the PROBE mixture contains 100% of 
a particular ddXTP combined with the other three dNTPs. Thus with PROBE a full stop of all detection primers is 
achieved after the first base complementary to the ddXTP is reached. 

£0379] Forthe G2/G3 genotype, the PROBE reaction (mixture of ddTTP. dATP, dCTP, dGTP) causes a Mr(exp) shift 
of the codon 1 1 2 primer to 591 9 Da, and of the codon 1 58 primer to 6769 and 7967 Da (Table VI); a pair of extension 
products results from the single codon 158 primer because the genotype is heterozygous at this position. Three 

extension products (one from codon 158, two from 112) are also observed from the heterozygote E3/G4 (Figure 61 c 
and Table VI), while only two products (one from each primer) are observed from the Figure 61b (G3/G3) and Figure 
59d (e4/E4) homozygote alleles. Refemng to Table VI, each of the available alleles result in all expected ddT reaction 
product masses within 0.1% of the theoretical mass, and thus each is unambiguously characterized by this data alone. 
Further configuration of the altete identities may be obtained by repeating the reaction with ddCTP (plus dATP, dTTP, 
dGTP); these results, summarized also in Table Vi, unambiguously confirm the ddT results. 

[0380] Appropriateness of the methods. Comparison of Figures 59 (restriction digestion) and 61 (PROBE) indicates 
that the PROBE method provides far more easily interpreted spectra forthe multiplex analysis of codon 112 and 158 
polymorphisms than does the restriction digest analysis. While the digests generate up to -25 peaks per mass spectrum 
and in some case diagnostic fragments overlapping with invariant fragments, the PROBE reaction generates a maxi- 
mum of only two peaks per detection primer (i.e. polymojphism). Automated peak detection, spectrum analysis, and 
allele identification would clearly be far more straightforward for the latter. Spectra for highly multiplexed PROBE, in 
which several polymorphic sites from the same or different amplified products are measured from one tube, are also 
potentially simple to analyze. Underscoring its flexibility, PROBE data analysis can be further simplified by Judicious a 
priori choice of primer lengths, which can be designed so that no primers or products can overlap in mass. 
[0361] Thus while PROBE is the method of choice for large scale clmical testing of previously well characterized 
polymorphic sites, the restriction digest analysis as described here Is ideally suited to screening for new mutations. 
The identity of each of the two polymorphisms discussed in this study affects the fragment pattern; if this is the only 
infomiatlon used, then the MS detection is a faster alternative to conventional electrophoretic separation of restriction 
fragment length polymorphism products. The exact measurement of fragment values can also give information on 
about sites completely remote from the enzyme recognition site since other singfe point mutations necessarily alter 
the mass of each of the single strands of the double stranded fragment containing the mutation. The 252 bp amplified 
product could also contain allelic variants resulting in, for example, previously described Glyl27 Asp (Weisgraber, KH 
et aL, (1984) J. CJin. Invest. 73:1024-1033), Argl36Ser (Wardell, MR et at, (1987) J. Clin. Invest. 80:483-490), 
Argl42Cys (Hone, Y et at., (1992) J. Biol. Chem. 267 :1962-1968). Arg145Cys (Rail SC Jr et aL, (1982) Proc. Natl. 
Acad. Sci. U.S.A. 79 :4696-4700). Lysl46Glu (IVIann. WA etaf., (1995) J. Clin. Invest. 96 :11 00-11 07). orLysl46Gln(Smit, 
M etai., (1 990) J. Lipid Res. 31 :45-53) substitutions. The G^A base substitution which codesforthe Glyl27 Asp amino 
acid substitution would result In a -1 6 Da shift in the sense strand, and In a + 1 5 Da (C->T) shift in the antisense strand, 
but not in a change in the restriction pattern. Such a minor change would be virtually invisible by electrophoresis; 
however, with accurate mass determination the substitution could be detected; the Invariant 56-mer fragment at 1 6240 
(sense) and 1 71 75 Da would shift to 1 6224 and 1 71 90 Da, respectively Obtaining the mass accuracy required to detect 
such minor mass shifts using current MALDI -TOP instrumentation, even with internal calibration, Is not routine since 
minor unresolved adducts and/or pooriy defined peaks limit the ability for accurate mass calMng, With high performance 
electrospray ionization Fourier transform (ESl-FTMS) single Da accuracy has been achieved with synthetic oligonu- 
cleotides (Little, DP efa/., (1995) Proc, Natl. Acad. Sci. U.S.A. 92 :2318-23221 upto 100-mers (Little, DP etai, (1994) 
J. Am. Chem. Soc. 116 :4893-4897). and similar results have recently been achieved wrth up to 25-mers using MALDl- 
FTMS (Li, Y etat., (1996) Anal. Chem. 68 :2090-2096). 
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EXAMPLE 13 

A Method for Ma«s Spectrometric Detection of DNA Fragments Associated With Telomerase Activity 

INTRODUCTION 

[0382] One-fourth of all deaths in the United States are due to malignant tumors (R.K. Jain, (1996) Science 271 : 
1079-1080). For diagnostic and therapeutic purposes there is a high interest in reliable and sensitive methods of tumor 
cell detection. 

[0383] Malignant cells can be distinguished from normal cells by different properties. One of those is the immortali- 
zation of malignant cells which enables uncontrolled cell-proliferation. Normal diploid mammalian cells undergo a finite 
number of population doublings in culture, before they undergo senescence. It is supposed that the number of popu- 
lation doublings in cutture, before they undergo senescence. It is supposed that the number of population doublings 
is related to the shortening of chromosome ends, called telomere^ in every cell division. The reason for said shortening 
is based on the properties of the conventional semlconservative replication machinery. DNA polymerases only work 
in 5' to 3* direction and need an RNA primer. 

[0384] Immortalization is thought to be associated with the expression of active telomerase. Said telomerase is a 
ribonucieoprotein catalyzing repetitive elongation of templates. This activity can be detected in a native protein extract 
of telomerase containing ceils by a special PCR-system (N.W. Kim et aL (1994) Science 266:2011-2015) known as 
telomeric repeat amplification protocol (TRAP). The assay, as used herein, is based on the telomerase specific exten- 
sion of a substrate primer (TS) and a subsequent amplification of the telomerase specific extension products by a PGR 
step using a second primer (bioCX) complementary to the repeat structure. The characteristic ladder fragments of 
those assays are conventionally detected by the use of gel electrop heretic and labeling or staining systems. These 
methods can be replaced by MALDI-TOF mass spectrometry leading to faster accurate and automated detection, 

MATERIALS AND METHODS 

Preparation of celis 

[0385] 1 X 1 cultured tetomerase-positive cells were pelleted, washed once with PBS (137 mM NaCl, 2.7 mM KCI, 
4.3 mM Na2HP04»7H£0, 1 .4 mM KH2PO4 in sterile DEPC water). The prepared cells may be stored at -75*0. Tissue 
samples have to be homogenized, according to procedures well known in the art, before extraction. 

Telomerase extraction 

[0386] Pellet was resuspended in 200 ^lI CHAPS lysis buffer (10 mM Tris-HCI pH 7.5, 1 mM MgCIa, 1 mM EGTA, 
0.1 mM benzamidine, 5 mM p-mercaptoethanol, 0.5% CHAPS, 10% glycerol) and incubated on ice for 30 min. The 
sanripie was centrifuged at 12,000 g for 30 min at 4'C. The supernatant was transferred into a fresh tube and stored 
at 75*C until use. 

TRAP-assay 

[0387] 2 ^ii of telomerase extract were added to a mixture of lOx TRAP buffer (200 mM Tris-HCI pH 8.3, 15 mM 
MgClg, 630 mM KCI, 0.05% Tween 20, 10 mM EGTA) 60x dNTP-mix (2.5 mM each dATP, dTTP, dGTP, and dCTP), 
10 pmol of TS primer and 50 pmol of bio CX primer in a final volume of 50 jil. The mixture was incubated at 30*C for 
10 minutes and 5 min. at 94*C, 2 units of Taq Polymerase were added and a PGR was performed with 30 cycles of 
94*C for 30 seconds, 50*0 for 30 seconds and 72''C for 45 seconds. 

Purification of TRAP-assay products 

[0386] For every TRAP-assay to be purified, 50 [li Streptavidin M-2e0 Dynabeads (10 mg/ml) were washed twice 
with 1x BW buffer (5 mW Tris-HCI, pH 7.5, 0.5 mM EDTA, 1 M NaCI). 50 ^1 of 2x BW buffer were added to the PCR 
mix and the beads were resuspended in this mixture. The beads were incubated under gentle shaking for 15 min. at 
ambient temperature. The supernatant was removed and the beads were washed twice with 1x BW buffer. To the 
beads 50 jil 25% ammonium hydroxide were added and incubated at 60"C for 10 min. The supernatant was saved, 
the procedure repeated, both supernatants were pooled and 300 \i\ ethanol (1 00%) were added. After 30 min. the DNA 
was pelleted at 13,000 rpm for 12 min., the pellet was air-dried and resuspended in 600 nl uitrapu re water. 
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MALDf-TOF MS of TRAP-assay products 

[0389] 300 nl sample were mixed with 500 ni of saturated matrix-solution (3-HPA:ammonium citrate = 10:1 molar 
ratio in 50% aqueous acetonitriie), dried at ambient temperature and introduced into the mass spectrometer (Vision 
2000, Finigan MAT). All spectra were collected in reflector mode using external calibration. 

Sequences and masses 

[0390] 

bioCX: d(bio-CCC TTA CCC TTA CCC TTA CCC TAA SEQ ID NO. 45), mass: 7540 Da. 

TS: d(AAT CCG TGC AGC AGA GTT SEQ ID N0.46), mass: 5523 Da. 

Tel omeric- repeat structure: (TTAGGG)^, mass of one repeat: 1909.2 Amplification products; 

TS elongated by three telomeric repeats (first amplification product): 12452 Da. (N3) 

TS elongated by four telomeric repeats: 14361 Da. (N4) 

TS elongated by seven teiomeric repeats: 20088 Da. (N7) 

RESULTS 

[0391] Figure 62 depicts a section of a TRAP-assay MALDI-TOF mass spectrum. Assigned are the primers TS and 
bioCX at 5497 and 7537 Da, respectively (calculated 5523 and 7540 Da). The signal marked by an asterisk represents 
n-1 primer product of chemical DNA synthesis. The first telomerase specific TRAP-assay product Is assigned at 12775 
Da. This product represents a 40-mer contaming three telomeric repeats. Due to primer sequences this is the first 
expected amplrfication product of a positive TRAP-assay. The product Is elongated by an additional nucleotide due to 
extendase activity of Taq DNA polymerase (calculated n on -extended product: 12452 Da, by A extended product: 12765 
Da). The signal at 6389 Da represents the doubly charged ion of this product (calculated: 6387 Da). Figure 63 shows 
a section of higher masses of the same spectrum as depicted in figure 62, therefore the signal at 1 2775 Da is identical 
to that in figure 62. The TRAP-assay product containing seven telomeric repeats, representing a 64- mer also elongated 
by an additional nucleotide, is detected at 20322 Da (calculated: 20395 Da). The signals marked 1 , 2, 3 and 4 cannot 
be base-tine resolved. This region includes of: 1. signal of dimeric n-1 primer, 2. second TRAP-assay amplification 
product, containing 4 telemeric repeats and therefore representing a46-mer (calculated: 14341 Da/14674 Da for ex- 
tendase elongated product) and 3. dimeric primer-ion and furthennore all their corresponding depurinatlon signals. 
There is a gap observed between the signals of the second and fifth extension product. This signal gap corresponds 
to the reduced band intensities observed in some cases for the third and fourth extension product in autoradiographic 
analysis of TRAP-assays (N.W. Kim et al. (1 994) Science 266:201 3). 

[0392] The above-mentioned problems, caused by the dimeric primer and related signals, can be overcome using 
an ultrafiltration step employing a molecularwelght cut-off membrane for primer removal prior to MALDI-TOF-MS anal- 
ysis. This will permit an unambiguous assignment of the second amplification product. 

EXAMPLE 14 

A method for Detecting Neuroblastoma-Speclfic Nested RTT-ampiifled products Via MALDI-TOF Mass 
Spectrometry 

Introduction 

[0393] Neuroblastoma is predominantly a tumor of early childhood with 66% of the cases presenting in children 
younger than 5 years of age. The most common symptoms are those due to tumor mass, bone pain, or those caused 
by excessive catecholamine secretion. In rare cases, neuroblastoma can be identified prenatatly (R.W, Jennings et al, 
(1 993) J. Ped. Surgery 28:11 68-11 74). Approximately 70% of all patients with neuroblastoma have metastatic disease 
at diagnosis. The prognosis is dependent on age at diagnosis, clinical stage and other parameters. 
[0394] For diagrrostic purposes there is a high interest in reliable and sensitive methods of tumor cell detection, e. 
c^^, in control of autologous bone marrow transplants or on-going therapy. 

[0395] Since catecholamine synthesis is a characteristic property of neuroblastoma cells and bone marrow cells lack 
this activity (H. Naito et al. , (1991) Eur. J. Cancej^ 27: 762-765), neuroblastoma cells or metastasis in bone marrow can 
be identified by detection of human tyrosine 3-hydroxylase (E.G. 1.14.16.2, hTH) which catalyzes the first step in 
biosynthesis of catecholamines. 

[0396] The expression of hTH can be detected via reverse transcrrption (RT) polymerase chain reaction (PGR) and 
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the amplified product can be analyzed via MALDI-TOF mass spectrometry. 

Materials and methods 

5 Ceff- or tissue-treatment 

[0397] Cultures cells were pelleted (10mln. 8000 rpm) and washed twice with PBS (137 mM NaCI,2.7mM KCI, 4.3 
mM fvJa2HP04»7H20, 1 A mM KH2PO4 in sterile PEPC water). The pellet was resuspended in 1 ml lysis/bfnding buffer 
(1 00 mM Tris-HCI, pH 8.0, 500 mM LiCI, 1 0 mM EDTA, 1 % Li-dodecyl sulfate, 5 mM DTT) until the solution becomes 
10 viscose. Viscosity was reduced by DNA-shearstep using a 1 misyringe. The lysate may be stored in -75*C orprocessed 
further directly. Solid tissues ( e.g. , patient samples) have to be homogenized before lysis. 

Preparation of magrietic Otigo-dT(25) beads 

f5 [0398] 100 jO-L beads per 1x10® cells were separated from the storage buffer and washed twice with 200 |xL lysis/ 
binding buffer. 

Isolatiof} of poly A*- RNA 

20 [0399] The cell lysate was added to the prepared beads and incubated for5 min. at ambient temperature. The beads 
were separated magnetically for 2-5 min. and washed twice with 0.5 ml LDS {10 mM Tris-HCI, pH S.O, 0.15 M LiCl, 1 
mM EDTA, 0.1% LIDS). 

Solid-phase first-strand cDNA synthesis 

25 

[0400] The poly A+RN A containing beads were resuspended in 20 ^iL of reverse transcription mix (50 mM Tris-HCI, 
pH S.3, 8 mM MgClg, 30 mM KCt, 10 mM DTT, 1 .7 mM dNTPs, 3 U AMV reverse transcriptase) and incubated for 1 
hour at 45*C (with a resuspension step all ten min.). The beads were separated from the reverse transcription mix, 
resuspended in 50 \iL of elution buffer (2 mM EDTA pH 6.0) and heated to 95'C for 1 min. fur elution of the RNA. The 
30 beads with the cDNA first-strand can be stored in TB (0.089 M Tris-base, 0.089 M boric acid, 0.2 mM EDTA pH 8.0), 
TE 10 mM Tris-HCI, 0.1 mM EDTA, pH 6.0) or 70% ethanol for further processing. 

Nested polymerase chain reaction 

35 [0401] Beads containing cDNA first-strand were washed twice with 1x PGR buffer (20 mM Tris-HCI pH 8.75, lOmM 
KCl, 10 mM (NHJ2SO4, 2 mM MgS04, 0.1% Triton X-100, 0.1 mg bovine serum albumin) and resuspended in PGR 
mix (containing 100) pmol of each outer primer, 2.5 u Pfu (exo-) DNA polymerase, 200 |^M of each dNTP and PCR 
buffer in a final volume of 50 i^L). The mixture was incubated at 72'*C 1 mm. and amplified by PCR for 30 cycles, for 
the nested reaction: 1 \iL of the first PCR was added as template to a PCR mix d(as above but nested primers instead 

40 of outer primers) and subjected to the following temperature program: 94'C 1 min,, 65*C 1 min, and 72* C 1 min. for 
20 cycles. 

Purification of nested amplified products 

45 [0402] Primers and iow-molecular reaction by-products are removed using 10,000 Da cut-off uitrafittration-unit. Ul- 
trafiitration was perfomned at 7,500 g for 25 minutes. For every PGR to be purified, BOjiL Streptavidin M-2B0 Dynabeads 
(10 mg/ml) were washed twice with ixBW buffer (5 mM Tris-HCI, pH 7.5, 0.5 mM EDTA, 1 M NaCI), added to the 
ultrafiltration membrane and incubated under gentle shaking for 15 min. at ambient temperature. The supernatant was 
removed and the beads were washed twice with IxBW buffer 50 |xL 25% ammonium hydroxide were added to the 

so beads and incubated at ambient temperature for 10 min. The supernatant was saved, the procedure repeated, both 
supernatants were pooled andSOOfiL ethanoi (100%) were added. After 30 min. the DNA was pelleted at 13,000 rpm 
for 12 min., the pellet was air-dried and resuspended in 600 nl ultrapure water. 

MALDt'TOF MS of nested amptified products 

55 

[0403] 300 nl sample was mixed with 500 nl of saturated matrix-solution (3-HPA: ammonium citrate = 10:1 moiar 
ratio in 50% aqueous acetonitriie), dried at ambient temperature and introduced into the mass spectrometer (Vision 
2000, FInigan MAT). All spectra were collected in reflector mode using external calibration. 
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Outer primers: 
[0404] 

hTHI : d{TGT CAG AGO TGG ACA AGT GT SEQ ID NO:47) 

hTH2: d(GAT ATT GTC TTC CCG GTA GO SEQ ID NO:48) Nested primers; 

bio-hTH d(bio-CTC GGA CCA GGT GTA CCG CC SEQ ID NO:49), nnass:6485 Da 

hTH6; d(CCT GTA CTG GAA GGC GAT CTC SEQ ID NO:50), mass:6422 21 Da 

mass of biotlnylated single strand amplified product: 19253:6 Da 

mass of nonblotinylated single strand amplified product: 18758.2 Da 

Results 

[0405] A MALDI-TOF mass spectrum of a human tyrosine 3-hydroxylase (hTH) specific nested amplified product 
(61 -mer) is depicted in figure 64. The signal at 1 8763 Da corresponds to n on -biotlnylated strand of the amplified product 
(calculated: 1875B.2 Da, mass error: 0.02 Da). The signals below 10,000 and above 35,000 Da are due to multiply 
charged and cfimeric amplified product-ions, respectively. 

[0406] The product was obtained from a solid phase cDNA derived in a reverse transcription reaction from 1x10^ 
cells of a neuroblastoma cell-line (L-A-N-1) as described above. The cDNA first-strand was subjected to a first PGR 
using outer primers (hTH1 and hTH2), an aliquot of this PGR was used as template in a second PGR using nested 
primers (bfohTH and hTH 6). The nested ampiified product was purified and MALDI-TOF MS analyzed: 
[0407] Tfie spectrum In Fig. 64 demonstrates the possibility of neuroblastoma cell detection using nested RT-PGR 
and MALDI-TOF MS analysis. 

EXAMPLE 15 

Rapid Detection of the RET Proto-oncogene Codon 634 Mutation Using Mass Spectrometry 

Material and Methods 
Probe 

[0408] The identity of codon 634 in each of the three alleles was confirmed by Rsai enzymatic digestion, single strand 
conformational polymorphism or Sanger sequencing. Exon 11 of the RET gene was PGR amplified (40 cycles) from 
genomic DNA using Taq-Polymerase (Boehringer-Mannhelm) with 8 pmol each of 5'-biotinylated forward (5"-biotin- 
CAT GAG GGA GAG CAT ACG CA-3' SEQ ID NO:51) and unmodified reverse (5'-GAC AGO AGO ACG GAG AGG 
AT-3' SEQ ID NO:52) primer per tube; amplified products were purified using the Qiagen (QIAquIck" kit to remove 
unincorporated primers. 15 |il of amplified product were immobilized on 10 ^lL (10 mg/mL) Dynal streptavidin coated 
magnetic beads, denatured using the manufacturer's protocol, and the supernatant containing antisense strand dis- 
carded, the PROBE reaction was performed using thermoSequenase (TS) DNA Polymerase (Amersham) and Phar- 
macia dNTP/ddNTPs. B pmol of extension primer (5'-CGG CTG CGA TCA CCG TGC GG-3' SEQ ID NO;53) was added 
to 13 M,L H^O, 2^L TS-buffer, 2)iL 2mM ddATP (or ddTTP), and 2^iL of 0.5 mM dGTP/dCTP/dTTP (or dGTP/DCTP/ 
dATP), and the mixture heated for 30 sec @ 94*C, foliowed by 30 cycles of 10 sec @ 94'C and 46 sec @ 50*C; after 
a 5 min. incubation @ 95'C, the supernatant was decanted, and products were desalted by ethanol precipitation with 
the addition of 0.5 \iL of 1 0mg/mL glycogen. The resulting pefiet was washed in 70% ethanol, air dried, and suspended 
in 1 |xL HgO. 300 nL of this was mixed with the MALDI matrix (0.7 M 3-hydroxyplcolmic acid, 0.07 M ammonium citrate 
in 1 :1 H20:CH3CN) on a stainless steel sample probe and air dried. Mass, spectra were collected on a Thermo Bio- 
nalysis Vision 2000 MALDi-TOF operated In reflectron mode with 5 and 20 kV on the target and conversion dynode, 
respectively. Experimental masses {m,(exp)) reported are those of the neutral molecules as measured using externai 
calibration. 

Direct MeasuremBnt of Diagriostic Products 

[0409] PGR amplifications conditions for a 44 bp region containing codon 634 were the same as above but using 
Pfu polymerase; the forward primer contained a ribonucleotide at its 3'-tenninus (forward, 5'-GAT CCA CTG TGC GAG 
GAG C {SEQ ID NO:54) -ribo; reverse, 5^-GCG GCTGCG ATC ACG GTG C (SEQ ID NO:55). After product immobi- 
lization and washing, 80 p.L of 1 2.5% NH4OH was added and heated at 80*C overnight to cleave te primer from 44-mer 
(sense strand) to give a 25'mer Supernatant was pipetted off while still hot, dried resuspendod In 50 |xL HgO, precip- 
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itated, resuspGnded, and measured by MALDI-TOF as above, MALDt-FTMS spectra of 25-mer synthetic analogs were 
collected as previously described (Li, Y, et aL. (1996) AnaL Chem. 68:2090-2096); briefly, 1-10 pmol DNA was mixed 
1:1 with matrix on a direct insertion probe, admitted into the external ion source (positive ion mode), ionized upon 
irradiance with a 337 nm waveiength laser puise, and transferred via rf-only quadruple rods into a 6.5 Tesia magnetic 
5 field where they were trapped collisionally. After a 1 5 second delay, ions were excited by a broadband chirp pulse and 
detected using 256K data points, resulting in time domain signals of 5 s duration. Reported (neutral) masses are those 
of the most abundant isotope peak after subtracting the mass of the charge carrying proton (1 .01 Da), 

Results 

w 

[041 0] The first scheme presented utilizes the PROBE reaction shown schematically in Figure 65. A 20-mer primer 
is designed to bind specifically to a region on the complementary template downstream of the mutation site; upon 
annealing to the template, which is labelled with biotin and immobilized to streptavidin coated magnetic beads, the 
PROBE primer is presented with a mixture of the three deoxynucleotide triphosphates (dNTPs), a di-dNTP (ddNTP), 
^5 and a DNA polymerase (Figure 65). The primer Is extended by a series of bases specific to the identity of the variable 
base in codon 634; for any reaction mixture (e.g. , ddA+dT+dC+dG), three possible extension products representing 
the three alleles are possible (Figure 65). 

[0411] For the negative control (Figure 66), the PROBE reaction with ddATP+dNTPs (N ^T, C, G) causes a Mr(exp) 
shift of the primer from 61 35 to 6726 Da (Am + 591 ). The absence of a peak at 6432 rules out a C^A mutation (Figure 

so 65); the mass of the single observed peak is more consistent with extension by C-ddA (Mr(calc) 6721 , +0.07% error) 
than by T-ddA (Mr(calc) 6736, -0.15% error) than of A3TC2G expected for C->A mutant. Combining the ddA and ddT 
reaction data, It is clear that the negative control is as expected homozygous nomnal at codon 634, 
[0412] The ddA reaction for patient 1 also results in a single peak (Mr(exp) = 6731) between expected values for 
wildtype and C->T mutation (Figure 65b). The ddT reaction, however, results In two clearly resolved peaks consistent 

25 with a heterozygote wildtype (Mr(exp) 8249, +0.04% mass error)/C->T mutant (Mr(exp) 6428 Da, +0.08% mass en-or). 
For patient 2, the pair of Figure 66c ddA products represent a heterozygote C->A (Mr(exp) 6431 , -0.06% mass error)/ 
nomnal (Mr(exp) 671 9, -0.03% mass error) allele. The ddT reaction confimns this, with a single peak measured at 8264 
Da consistent with unresolved wildtype and C-*A alleles. The value of duplicate experiments Is seen by comparing 
Figures 66a and 66b; while for patient 1 the peak at 6726 from the ddA reaction represents only one species, similar 

30 peak from patient 1 is actually a pair of unresolved peaks differing in mass by 1 5 Da. 

[0413] An alternate scheme for point mutation detection is differentiation of alleles by direct measurement of diag- 
nostic product masses. A 44-mer containing the RET634 site was generated by the PGR, and the 1 9-mer sense primer 
removed by NH4OH cleavage at a ribonucleotide at its 3' terminus. 

[0414] Figure 67 shows a series of MALDhFTMS spectra of synthetic analogs of short amplified products containing 
35 the RET634 mutant site. Figures 67a-c and 67d-f are homozygous and heterozygous genotypesj respectively. An 
internal calibration was done using the most abundant isotope peak for the wildtype allele; application of this (external) 
calibration to the five other spectra resulted in better than 20 ppm mass accuracy for each. Differentiation by mass 
aione of the alleles is straightforward, even for heterozygote mixtures whose components differ by 1 6.00 (Figure 67d), 
2501 (Figure 67e), or 9.01 Da (Figure 65f). The value of high performance MS Is clear when recognition of small DNA 
4£> mass shifts is the basis for diagnosis of the presence or absence of a mutation. The recent reintroduction of delayed 
extraction (DE) techniques has improved the perfomiance of MALDI-TOFwith shorts DNAs (Roskey, M.T. etaL, (1996) 
Anal. Chem. 68:941-946); a resolving power (RP) of>103 has been reported for a mixed-base 50-mer, and a pair of 
31 -mere with a C or a T (Am IS Da) at a variable position resolved nearly to baseline. Thus DE-TOF-MS has demon- 
strated the RP required for separation of the individual components of heterozygotes. Even with DE, however, the 
45 precision of DNA mass measurement with TOF is typically 0.1 % (8 Da at 8 kDa) using external calibration, sufficiently 
high to result in incorrect diagnoses. Despite the possibility of space charge induced frequency shifts (Marshall, A.G. 
etai. (1991) Anal. Chem. 63:21 5A-229A), MALDI-FTMS mass errors are rarely as high as 0.005% (0.4 Da at 8 KDa), 
making internal calibration unnecessary. 

[0415] The methods for DNA point mutation presented here are not only applicable to the analysis of single base 
^0 mutations, but also to less demanding detection of single or multiple base insertions or deletions, and quantification 
of tandem two, three ^ or four base repeats. The PROBE reaction yields products amenable to analysis by relatively 
low performance ESI or MALDI instrumentation; direct measurement of short amplified product masses is an even 
more direct means of mutation detection.and will likely become more widespread with the Increasing Interest in high 
performance MS available with FTMS. 
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EXAMPLE 16 

Immobilization of nucleic acids on solid supports via an acid-labile covalent bifunctlonal trityl linker 

^ [0416] Aminolinked DNA was prepared and purified according to standard methods. A portion (10eq) was evaporated 
to dryness on a speedvac and suspended in anhydrous DMF/pyridine (9:1 ; 0.1 ml). To this was added the chlorotrityl 
chloride resin (1 eq, 1 ,05iimol/mg ioading) and the nnixture was shaken for 24 hours. The loading was checked by 
taking a sample of the resin, detritylating this using 80% AcOH, and measuring the absorbance at 260nm. Loading 
was ca. 150pmol/mg resin. 

10 [0417] In 80% acetic acid, the half-life of cleavage was found to be substantially less than 5 minutes-this compares 
with trityl ether-based approaches of half-lives of 1 05 and 39 minutes for para and mefa substituted bifunctional dimeth- 
oxytrityl linkers respectively. Prelimfnary results have also indicated that the hydroxy picolfnic acid matrix alone is 
sufficient to cleave the DNA from the chlorotrityl resin. 

15 EXAMPLE 17 

Immobilization of nucleic acids on solid supports via hydrophobic trityl linker 

[0418] The primer contained a 5'-dimeth oxytrityl group attached using routine trityl-on DNA synthesis. 
20 [041 9] CI8 beads from an oligo purification cartridge (0.2 mg) placed in a filter tip was washed with acetonitrile, then 

the solution of DNA (50 ng in 25 ^1) was flushed through. This was then washed with 5% acetonitrile in ammonium 

citrate buffer (70 mM, 250 p.1). To remove the DNA form the CIS, the beads were washed with 40% acetonitrile in water 

(1 0^\) and concentrated to ca 2 ^li on the Speedvac. The sample was then submitted to MALDl. 

[0420] The results showed that acetonitrile/water at levels of ca.>30% are enough to dissociate the hydrophobic 
25 interaction. Since the matrix used in MALDl contains 50% acetonitrile, the DNA can be released from the support and 

successfully detected using MALD)-TOF MS (with the trityl group removed during the MALDl process). 

[0421] Figure 69 is a schematic representation of nucleic acid immobilization via hydrophobic trityl linkers. 

EXAMPLE 18 

30 

Immobilization of nucleic acids on solid supports via Streptavidin-lmlnobiotin 

Experimental Procedure 

35 [0422] 2-iminobiotin N- hydroxy -sued nimid ester (Sigma) was conjugated to the oligonucleotides with a 3'- or5-'amino 
linker following the conditions suggested by the manufacturer The completion of the reaction was confirmed by MALDI- 
TOF MS analysis and the product was purified by reverse phase HPLC. 

[0423] For each reaction, 0.1 mg of streptavldin -coated magnetic beads (Dynabeads M-280 Streptavidin from Dynal) 
were incubated with 80 pmol of the corresponding oligo in the presence of 1 M NaCI and 50 mM ammonium carbonate 

40 (pH 9.5) at room temperature for one hour. The beads bound with oligonucleotides were washed twice with 50 mM 
ammonium carbonate (pH 9.5). Then the beads were incubated in 2|xl of 3-HPA matrix at room temperature for 2 min. 
An aliquot of 0.5 ^lI of supernatant was applied to MALDI-TOF. For biotin displacement experiment, 1 .6. mol of free 
biotin (80-fold excess to the bound oligo) in 1 ^1 of 50 mM ammonium citrate was added to the beads. After a 5 min. 
Incubation at room temperature, 1 |il of 3-HPA matrix was added and 0.5 \l\ of supernatant was applied to MALDI-TOF 

^5 MS. To maximize the recovery of the bound iminobiotin oligo, the beads from the above trea^ent were again incubated 
with a 2 M.I of 3'HPA matrix and 0,5 jil of supernatant was applied to MALDI-TOF MS. The matrix alone and free biotin 
treatment quantitatively released iminobiotin oligo off the streptavidin beads as shown in Figures 70 and 71 . 

EXAMPLE 19 

so 

Mutation Analysis UsJng Loop Primer Oligo Base Extension 
MATERIALS AND METHODS 

55 [0424] Genomic DNA. Genomic DNA was obtained from healthy individuals and patients suffering from sickle cell 
anemia. The wildtype and mutated sequences have been evaluated conventionally by standard Sanger sequencing. 
[0425] PCR-Amptiftcation. PGR ampltfications of a part of the p-globin was established and optimized to use the 
reaction product without a further purification step for capturing with streptavidin coated bead. The target amplification 
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for LOOP-PROBE reactions were performed with the ioop-cod5 d{GAG TCA GGT GCG CCA TGC CTC AAA CAG 
ACA CCA TGG CGC, SEQ iD No. 58) as forward primer and p-11-bio d(TCT CTG TCT CCA CAT GCG CAG, SEQ 
iD. No. 59) as biotinyiated reverse primer. The underiined nucieotide in the loop-cod5 primer is mutated to introduce 
an invariant Cfol restriction site into the amplicon and the nucleotides in italics are complementary to a part of the 
amplified product. The total PCR volume was 50^l1 including 200 ng genomic DNA, 1U Taq-polym erase (Boehringer- 
Marnheim, Cat# 1596594), 1.5 mM MgCJg, 0,2 mM dNTPs (Boehringer-Mannheim, Ca# 1277049), and 10 pmol of 
each primer, A specific fragment of the p-globin gene was amplified using the following cycling condition: 5 min 94"C 
followed by 40 cycles of : 30 sec @ 94'C. 30 sec @ 56'C, 30 sec @ 72*C, and a final extension of 2 min at 72'C. 
[0426] Capturing and ctenatumtion of biotinytated templates. 10|il paramagnetic beads coated with streptavidin 
(10mg/ml; Dynal, Dynabeads M-280 streptavidin Cat# 112.06) and treated with 5x binding solution (5 M NH4CI, 0.3M 
NH4OH) were added to 40 [i\ PCR volume (1 0\x.\ of the amplified product was saved for check electrophoresis). After 
incubation for 30 min at 37*C the supernatant was discarded. The captured templates were denatured with 50 ^1 1 00 
mM NaOH for 5 min at ambient temperature, then washed once with 50 yi\ 50 mM NH4OH and three times witii 1 00 ^il 
1 0mM Tris.CI, pH 8.0. The single stranded DNA served as templates for PROBE reactions. 

[0427] Primer oUgo base extension (PROBE) reaction. The PROBE reactions were performed using Sequenase 2.0 
(USB Cat# E70775Z Including buffer) as enzyme and dNTPs and ddNTPs supplied by Boehringer-Mannheim (Cat# 
1277049 and 100B3B2). The ratio between dNTPs (dCTP, dGTP, dTTP) and ddATP was 1:1 and the total used con- 
centration was 50 |j,M of each nucleotide. After addition of 5|il 1-fold Sequenase-buffer the beads were incubated for 
6 min at 65**C and for 1 0 min at 37'C. During this time the partially self complementary primer annealed with the target 
site. The enzymatic reaction started after addition of 0.5 jj,l 100 mM dithiothreitol (DTT), 3.5 |il dNTP/ddNTP solution, 
and 0.5 \i\ Sequenase (0.8 U) and incubated at 37"C for 1 0 min. Hereafter, the beads were washed once In 1 -fold TE 
buffer (1 0 mM Tris, 1 mM EDTA, pH 8.0). 

[0428] Cfol restriction digest The restriction enzyme digest was perfonned in a total volume of 5^Ll using 10 U Cfol 
in 1-fold buffer L purchased from Boehringer-Wannheim. The incubation time was 20 min at 37*C. 

Conditioning of tiie diagnostic products for mass spectrometric analysis 

[0429] After the restriction digest, the supernatant was precipitated in 45 ^1 H2O, 10 |xl 3M NH4- acetate (pH 6.5), 
0.5 1^1 glycogen (10 mg/ml in water, Sigma, Cat# G1765). and 110^.1 absofute ethanol for 1 hour at room temperature. 
After centrifugation at 13,000 g for 10 min the pellet was washed in 70% ethanol and resuspended in 2 i^MS Mohm/ 
cm H2O. The beads were washed in 1 00 jj^i 0.7 M HH^ citrate followed by 1 00 \i\ 0,05 M NH^ citrate. The diagnostic 
products were obtained by heating the beads in 2 \i\ 50 mM NH4OH at SO'^C for 2 min. 

Sample preparation and anatysis on I^ALDf-TOF mass spectrometry. 

[0430] Same preparation was performed by mixing 0.6 \\\ of matrix solution (0.7 M 3-hydroxypicoiinic acid, 0.07 M 
dibasic ammonium citrate in 1 :1 HaO^CHaCN) with 0.3 |il of either resuspended DNA/glycogen pellet or supernatant 
after heatfng the beads in 50 mM NH4OH on a sample target and allowed to air dry. The sample target was automatically 
introduced in to the source region of an unmodified Perspective Voyager MALDI-TOF operated in delayed extraction 
linear mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical molecular mass (Mr(calc)) 
were calculated from atomic compositions; reported experimental (Mr(exp)) values are those of the singly-protonated 
form. 

RESULTS 

[0431] The LOOP-PROBE has been applied to the detection of the most common mutation of codon 6 of the human 
p-globin gene leading to sickle cell anemia. The single steps of the method are schematically presented in figure 72. 
For the analysis of codon 6, a part of the p-globin gene was amplified by PCR using the biotinyiated reverse primer 
piibio and the primer loop-codS which is modified to introduce a Cfol recognition site (fig. 72a). The amplified product 
is 192 bp in length. After PCR the amptiflcation product was bound to streptavidin coated paramagnetic particles as 
described above. The antisense strand was isolated by denaturation of the double stranded amplified product (Fig. 
72b). The intra-molecule annealing of the complementary 3* end was accomplished by a short heat denaturation step 
and incubation at 37*C, The 3' end of the antisense strand is now partially double stranded (Fig. 72c). For analyzing 
the DNA downstream of the self annealed 3'-end of the antisense strand, the primer oligo base extension (PROBE) 
has been performed using ddATP, dCTP, dGTP, dTTP (Fig. 72d). This generates different products in length specific 
for the genotype of the analyzed individual. Before the determination of the length of these diagnostic products, the 
DNA was incubated with the Cfol restriction endonuclease that cuts 5' of the extended product. This step frees the 
stem loop from the template DNA whereas the extended product still keeps attached to the template. The extended 
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products are then denatured by heatirtg from the template stand and analyzed by MALDI-TOF mass spectrometry. 
[0432] Si nee the MALDI-TOF analyses were performed with a non-calibrated instrument, the mass deviation between 
observed and expected values was approximately 0.6% higher than theoretically calculated. Nevertheless, the results 
obtained were conclusive and reproducible within repeated experiments. In all analyzed supernatants after the restric- 
tion digest the stem loop could be detected. Independent of the genotype, the stem loop has had in all analyses mo™ 
lecular masses about 81 50 Da (expected 8111 Da). An example is shown in Figure 73a. The second peak in this figure 
with a mass of 4076 Da Is a doubly charged ion of the stem loop. Figure 73b to 73d show the analyses of different 
genotypes as indicated In the respective inserts. HbA is the wildtype genotype and HbC and HbS are two different 
mutations in codon 6 of the p-gJobin gene which cause sickle cell disease. In the wildtype situation a single peak with 
a molecular mass of 4247 Da and another with 6696 Da are detected (Fig, 73b). The latter corresponds to the bloti- 
nylated PGR primer (p-11-blo) unused in the PGR reaction which also has been removed in some experiments. The 
former corresponds to the diagnostic product for HbA, The analyses of the two individual DNA molecules with HbS 
trait as welf as compound heterozygosity (HbS/HbC) for the sickle cell disorder lead also to unambiguous expected 
results (Fig. 73c and 73d). 

[0433] In conclusion, the LOOP-PROBE is a powerful means for detection of mutations espec(ally predominant dis- 
ease causing mutations or common polymorphisms. The technique eliminates one specific reagent for mutation de- 
tection and, therefore, simplifies the process and makes it more amenable to automation. The specific extended product 
that is analyzed is cleaved off from the primer and is therefore shorter compared to the conventional method, In addition, 
the annealing efficiency is higher compared to annealing of an added primer and should therefore generate more 
product. The process is compatible with multiplexing and various detection schemes (e.g. , single base extension, oligo 
base extension and sequencing). For example, the extension of the loop-primer can be used for generation of short 
diagnostic sequencing tadders within highfy polymorphic regions to perform, for example, HLA typing or resistance as 
well as species typing ( e.g. , Mycobacterium tuberculosis)). 

EXAMPLE 20 

T7-RNA Polymerase Dependent Amplification of CKR-5 and Detection by MALDI-MS 
MATERIALS AND METHODS 

[0434] Genomic DNA, Human genomic DNA was obtained from heaJthy individuals. 

[0435] PCR-Ampfification and Punficaiion. PGR amplification of a part of the CKR-5 gene was accomplished using 
cl<rT7f as sense primer d(ACC TAG CGT TCA GTT CGA CTG AGA TAA TAC GAC TCA CTA TAG GAG CTC TCA 
I I \ TCC ATA C (SEQ ID NO. 60). The underlined sequence corresponds to the sequence homologous to CKR-5. the 
bolded sequence corresponds to the T7-RNA polymerase promoter sequence and the italic sequence was chosen 
randomly. cItrSr was used as antlsense primer d(AAC TAA GCC ATG TGC AGA AGA (SEQ ID NO, 61 ), Purification of 
the amplified product and removal of unincorporated nucleotides was carried out using the QIAquick purification l<it 
(Qiagen, cat# 28104), In the final PGR volume of 50 ^li were 200 ng genomic DNA, 1 U Taq -polymerase (Boehringer- 
Mannheim, cat# 1 596594), 1 .5 mM MgCI^ 0.2 mM dNTPs (Boehringer-Mannheim, cat# 1277049), and 1 0 pmof of each 
primer. The specific fragment of the CKR-5 gene was amplified using the following cycling conditions: 5 min @ 94**C 
followed by 40 cycles of 45 sec @ 45 sec 52'C, 5 sec @ 72'C, and a final extension of 5 min at 72*'C. 

[0436] T7-RNA PofymerBse conditions. One third of the purified DNA (about 60ng) was used in the T7-RN A poiymer- 
ase reaction. (Boehringer-Mannheim, cat# 881 767). The reaction was carried out for 2h at 37'C according to the 
manufacturer's conditions using the included buffer The final reaction volume was 20 ^1 0,7 |il RNasin (33 U/^ii) had 
been added. After the extension reaction, the enzyme was inactivated by incubation for 5 min at 65*G. 

DNA digestion and conditioning of the diagnostic products for mass spec anaiysis. 

[0437] The template DNA was digested by adding RNase-free DNase f (Boehringer-IVIannheimn, cat# 776 758) to 
the inactivated T7 mixture and Incubation for 20 min at room temperature. Precipitation was carried out by adding 1 
1^1 glycogen (10 mg/ml, Sigma, cat# G1765), 1/10 volume 3M NHg. acetate (pH 6.5), and 3 volume absolute ethanol 
and incubation for 1 hour at room temperature. After centrifugation at 13,000 g for 10 min, the pellet was washed in 
70% ethanol and resuspended in 3 |xl 18 IVIohm/cm H2O. 1 [i\ was analyzed on an agarose gel, 

Sample preparation and analysis on MALDI-TOF mass spectrometry 

[043q Sample preparation was performed by mixing 0.6 [l\ of matrix solution (0,7 M 3-hydroxypicolinic acid, 0.07 IVI 
dibasic amnnoniunn citrate in 1 :1 HjOiCHsCN) with 0.3 |jil of resuspended DNA/glycogen on a sampletarget and allowed 
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to air dry. The sampJe target was introduced into tine source region of an unmodified Finnigan VISJOiM2000 MALDl- 
TOF operated in reiectron mode with 5I<V. The tlieoreticai molecular mass was calculated form atomic composition; 
reported experimental values are those of singfy-pronated form, 

RESULTS 

[0439] The chemol<ine receptor CKR-5 has been identified as a major coreceptor in HIV-1 (see e.g,, WO 36/39437 
to Human Genome Sciences; Cohen^ J. et ai. Science 275:1261). A mutant allele that is characterized by a 32 bp 
deletion is found In 16% of the HIV-1 seronegative population whereas the frequency of this allele is 35% lower in the 
HIV-1 seropositive population. It is assumed that individuals homozygous for this ailele are resistant to HIV-1. The 
T7-RNA polymerase dependent amplification was applied to identify this specific region of the chemokine receptor 
CKR-5 {Figure 74). Human genomic DNA was amplified using conventional PGR. The sense primer has been modified 
so that it contains a random sequence of 24 bases that facilitate polymerase binding and the T7-RNA polymerase 
promoter sequence (Figure 75). The putative start of transcription is at the first base 5' of the promoter sequence. ckr5r 
was used as an anlisense primer. PGR conditions are outlined above. The amplified product derived from wlldtype 
alleles is 75 bp in length. Primer and nucleotides were separated from the amplification product using the QIagen 
QIAquIck purification kit. One third of the purified product was applied to in vitro transcription with T7-RNA polymerase. 
To circumvent Interference of the template DNA, it was digested by adding RNase-free DNase I. RNA was precipitated 
and this step also leaves the degraded DNA in the supernatant. Part of the redissolved RNA was analyzed on an 
agarose gel and the rest of the sample was prepared for MALDI-TOF analysis, The expected calculated mass of the 
product is 24560 Da. A dominant peak, that corresponds to an approximate mass of 25378.5 Da can be observed. 
Since the peak is very broad, an accurate determination of molecular mass was not possible. The peak does not 
correspond to residual DNA template. First, the template DNA is digested, and second, the DNA strands would have 
a mass of 23036.0 and 23174 Da, respectively 

[0440] This example shows that T7 RNA polymerase can effectively amplify target DNA. The generated RNA can 
be detected by Mass spectrometry, in conjunction with modified (e.g. , 3*-deoxy)ribonucleotides that are specifically 
incorporated by a RNA polymerase but not extended any further, this method can be applied to determine the sequence 
of a template DNA. 

EXAMPLE 21 

MALDI Mass Spectrometry of RNA Endonuclease Digests 
MATERIALS 

[0441] Synthetic RNA (Sample A:5'-UCGGGUCUGAUGAGUCGGUGAGGAC-3'(SEQ ID 62); sample B:5'-GUGAC- 
UACAGGUGAGCUCGA-3' (SEQ ID NO 63); sample C:5'-CGAUGGGAGAGUAAGUAGUA-3' (SEQ ID NO, 64)) sam- 
ples were obtained from DNA technology (Aahus, Denmark) and purified on a denaturing polyacrylamide gel (Shaler, 
T. A. et ai. (1996) Anal. Chem. 63 :5766-579^ Rnases T^ (Eurogentec), (Calbiochem), A (Boehrlnger-Mannhelm) 
and PhyM (Pharmacia) were used without additional purification, Streptavi din-coated magnetic beads (Dynabeads M- 
280 Streptavidin, DynaJ) were supplied as a suspension of 6-7 x 10^ bead/ml (10 mg/ml) dissolved in phosphate- 
buffered saline (PBS) containing 0.1% BSA and 0.02% NaNg. 3-Hydroxypicolinic acid (3-HPA) (Aldrich) was purified 
by a separate desalting step before use as described in more detail elsewhere (Little, D. P. et ai (1995) Proc. Natl. 
Acad. Sci. U.S.A. 92 , 2318-2322). 

METHODS 

[0442] In vitro transcription reaction. The 5'-biotinylated 49 nt in vitro transcript (SEQ ID No. 65): 

AGGCCUGCGGCAAGACGGAAAQACCAUGGUCCCUNAUCUGCCGCAGGAUC 

was produced by transcription of the plasmid pUTMS2 (linearized with the restriction enzyme BamHI) with T7 RNA 
polymerase (Promega). For the transcription reaction 3 [ig template DNA and 50u T7 RNA polymerase were used in 
a 50 {i\ volume of 1 u/jil RNA guard (Rnax inhibitor, Pharmacia), 0.5 mM NTP's 1 .0 mM 5'-biotin-ApG dinucieotide, 40 
mM Tris-HCi (pH 8.0), 6 mM MgClj 2 mlVI spermidine and 1 0 mM DTT. Incubation was performed at 37*C for 1 hour, 
then another aliquot of 50 units T7 RNA polymerase was added and incubation was continued for another hour. The 
mixture was adjusted to 2M NH^ acetate and the RNA was precipitated by addition of one volume of ethanol and one 
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volume of isopropanol. The prectpttated RNA was coflGcted by centrifugation at 20,000 X g for 90 min at 4'C, the pellet 
was washed with 70% ethanol, dried and redissolved at 8 M urea. Further purification was achieved by electrophoresis 
through a denaturing poly acryl amide gel as described elsewhere (Shaler, T A. et aL (1 996) Anal. Chem. 68:576-579). 
The ration of B'-biotinylated to non-biotinylated transcripts was about 3:1 . 

[0443] Hibonuciease assay. For partial digestion with selected RNases different enzyme concentrations ad assay 
conditions were employed as summarized in table VII. The solvents for each enzyme were selected foilowing the 
suppliers' instructions. The concentrations of the synthetic RNA samples and the tn Wf«? transcript were adjusted to 
5-10 X lO-^M. 
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[0444] The reaction was stopped at selected times by mixing 0.6 ^tl allquots of the assay with 1 .5 |il of 3 HPA-solution. 
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The solvent was subsequently evaporated in a stream of cold air for the MALDI-MS analysis. 

[0445] Limited alkaline hydrolysis was performed by mixing equal volumes (2.0 jil) of 25% ammonium hydroxide and 
RNA sample (5-1 Ox 10"^ M) at 60**C, 1 ^1 aliquots were taken out at selected times and dried in a stream of cold air, 
For these samples it turned out to be important to first dry the digests in a stream of cold air, before 1 ,5jj.l of the matrix 

5 solution and 0.7 |j.l of NH4+ loaded cation exchanged polymer beads were added. 

[0446] The reaction was stopped at selected times by mixing 0.6 p,l aliquots of the assay with 1 .5 |il of 3HPA-solution. 
The solvent was subsequently evaporated in a stream of cold air for the MALDI-MS analysis. 
[0447] Limited alkaline hydrolysis was performed by mixing equal volumes (2.0 |a.l) of 26% ammonium hydroxide and 
RNA sample (5-10 X 1 0"^ M) at 60'C. 1 |j.l aliquots were taken out at selected times and dried in a stream of cold air. 

^0 For these samples rt turned out to be important to first dry the digests in a stream of cold air, before 1 .5 \x\ of the matrix 
solution and 0.7 ^il if a suspension of MH^-^ loaded cation exchange polymer beads were added. 
[0448] Separation of 5'-biotinylated fragments. Steptavid in -coated magnetic beads were utilized to separate 5'-bioti- 
nylated fragments of the in vitro transcript after partial RNase degradation. The biotin moiety in this sample was intro- 
duced during the transcription reaction initiated by the 5'-biotin-pApG-dinucleotide. Priorto use, the beads were washed 

^5 twice with 2 x binding & washing (b&w) buffer (20 mM Tris-HCI, 2 mM EDTA, 2 M NaCI pH 8.2) and resuspended at 
1 0 mg/ml in 2 x b&w buffer. Circa 25 pmol of the RNA in vitro transcript were digested by RNase U2 using the protocol 
described above. The digestion was stopped by adding 3 ^il of 95% formamide containing 10 mM trans- 1 ,2-diamino- 
cyclohexane-N,N,N'',N''-tetraacetlc acid (CDTA) at 90^*0 for 5 min, followed by cooling on ice. Subsequently, capture 
of the biotinylated fragments was achieved by incubation of 6(xl of the digest with 6 |j,l of the bead suspension and 3|xl 

20 of b&w buffer at room temperature for 15 min. Given the blndir^g capacity of the beads of 200 pmol of biotinylated 
oligonucleotide per mg of beads, as specified by the manufacturer, the almost 2-times excess of oligonucleotide was 
used to assure a full loading of the beads. The supernatant was removed, and the beads were washed twice with 6|il 
of H2O. The CDTA and 95% formamide at 90*C for 6 min. After evaporation of the solvent and the formamide the ^.5 
pmol of fragments were resuspended in 2 \i\ H2O and analyzed by MALDI-MS as described above. 

25 [0449] Sampie preparation for MALDI-MS, 3-Hydroxypicolinic acid (3-HPA) was dissolved in ultra pure water to a 
concentration of ca. 300 mM. Metal cations were exchanged against as described in detail previously. (Little, D. 
P. etai (1 995) Proc. Natl. Acad. Sci. U.S.A. 92: 231 8-2322). Aliquots of 0.6 jil of the analyte solution were mixed with 
1 .5 \\X 3-HPA on a flat inert metal substrate. Remaining alkali cations, present in the sample solution as well as on the 
substrate surface, were removed by the addition of 0.7 ]il of the solution of NH4+ - loaded cation exchange polymer 

30 beads. During solvent evaporation, the beads accumulated in the center of the preparation, were not used for the 
analysis, and were easily removed with a pipette tip, 

[0450] instrument, A prototype of the Vision 2OO0 (ThermBioanaiysis, Heme!, Hempstead, UK) reflectron time of 
flight mass spectrometer was used for the mass spectrometry. Ions were generated by irradiation with a frequency- 
tripled ND:YAG laser (355 nm, 5 ns; Spektrum GmbH, Berlin, Germany) and acceJerated to 10 ke V. Delayed ion 

35 extraction was used for the acquisition of the spectra shown, as it was found to substantially enhance the signal to 
noise ratio and/or signal intensity. The equivalent flight path length of the system is 1.7 m, the base pressure is 10^ 
Pa. Ions were detected with a discrete dy node secondary-electron multiplier (R2362, Hamamatsu Photonics), equipped 
with a conversion dynode for effective detection of high mass ions. The total impact energy of the ions on the conversion 
dynode was adjusted to values ranging from 16 to 25 keV, depending on the mass to be detected. The preampllfied 

40 output signal of the SEM was digitized by a LeCroy 9450 transient recorder (LeCroy, Chestnut Ridge, NY, USA) with 
a sampling rate of up to 400 MHz. For storage and further evaluation, the data were transferred to a personal computer 
equipped with custom-made software (ULISSES). All spectra shown were taken in the positive ion mode. Between 20 
and 30 single shot spectra were averaged for each of the spectra shown. 

4S RESULTS 

[0451] Specificity of Hnases. Combining base-specific RNA cleavage with MALDI-MS requires reaction conditions 
optimized to retain the activity and specificity of the selected enzymes on the one hand and complying with the boundary 
conditions for MALDI on the other. Incompatibility mainly results because the alkaline-ion buffers, commonly used in 

50 the described reaction, such as Na-phosphate, Na-citrate or Na-acetate as well as EDTA interfere with the MALDI 
sample preparation; presumably they disturb the matrix crystallization and/or analyte incorporation. Tris-HCI or am- 
monium salt buffers, in contrast, are MALDI compatible (Shaler, T. A. etal. (1 996) Anal. Chem. 68:576-579). Moreover, 
alkaline salts in the sample lead to the formation of a heterogenous mixture of multiple salts of the analyte, a problem 
increasing with increasing number of phosphate groups. Such mixtures result in loss of mass resolution and accuracy 

55 as well as signal-to-noise ratio (Little, D. P. et ai (1995) Proc. Natl. Acad. Sci. U.S.A. 92:2318-2322; Nordhoff, E., 
Cramer, R, Karas, M.,Hiltenkamp, R, Kirpekar, F., Kristiansen, K. and Roepstorff, P. (1993) Nucleic Acids Res., 21, 
3347-3357). Therefore, RNase digestions were can'ied out under somewhat modified conditions compared to the ones 
described in the literature. They are summarized above in table VII. For Rnase T^, A, CL3 ad Cusativin, Tris-HCI (pH 
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6-7.5) was used as buffer 20 mM DAC provides the pH of 5, recommended for maximum activity of RNases U2 and 
PliyM. The concentration of 10-20 mM of these compounds were found to not interfere significantly with the MALDI 
analysis. To examine the specificity of the selected ribonucieases under these conditions, three synthetic 20-25mer 
RNA molecules with different nucleotide sequences were digested. 

[0452] The MALDI -MS spectra of Figure 77 shows five different cleavage patterns (A-E) of a 25 nt RNA obtained 
after partial digestion with RNases T^^ U2, PhyM, A, and alkaline hydrolysis. These spectra were taken from aliquots 
which were removed from the assay after empirfcally determined incubation times, chosen to get an optimum coverage 
of the sequence. As the resulting samples were not fractionated prior to mass spectrometric analysis, they contain all 
fragments generated at that time by the respective RNases. In practice, uniformity of the cleavages, can be affected 
by a preferential attack on the specific phophodiester bonds (Donis-Keller, H., Maxam, A. M., and Gilbert, W. (1977) 
Nucleic Acids Res,, 4, 1 957-1978; Donis-Keller, H. {1 980) Nucleic Acids Res., 8 3133-3142). The majority of the ex- 
pected fragments are indeed observed in the spectra. It is also worth noting that for the reaction protocols as used, 
correct assignment of all fragment masses is only possible, if a 2', 3*-cyclic phosphate group is assumed. It is well 
known that such cyclic phosphates are intennediates in the cleavage reaction and get hydrolyzed in a second, inde- 
pendent and slower reaction step involving the enzyme (Richards, F. M., and Wycoff, H. W. in The Enzymes Vol 4, 
3rd Ed., (ed. Boyer, P.D.) 746-806 (1971 , Academic Press, New York); Helnemann, U and W. Saenger (1985) Pure 
Appl. Chem. 57, 417-422; Ikehara, M. elal., (1987) PureAppI, Chem. 59-965-96B) Vreslow, R. andXu, R. (1993) Proc. 
Nal. Acad. Sa\ USA, SO, 1201-1207). In a few cases different fragments have equal mass of differ by as little as 1 
Dalton., In these cases, mass peaks cannot unambiguously be assigned to one or the other fragments. Digestion of 
two additional different 20 nt RNA samples was, therefore, perfomned (Hahner, S., Kirpekar, F, Nordoff, E., Kristiansen, 
K., Roepstorff, P. and Hillenkamp, F. (1 996) Proceedings of the 44th ASMS Conference on Mass Spectrometry, Port- 
land, Oregon) in order to sort out these ambiguities. For all samples tested, the selected ribonucieases appear to 
cleave exclusively at the specified nucleotides leading to fragments arising from single as weli as multiple cleavages. 
[0453] In Figure 77, peaks, indicating fragments containing the original 5'-tennfnus. are marked by arrows. All non 
marked peaks can be assigned to internal sequences or those with retained 3'-terminus. For a complete sequence ali 
possible fragments bearing exclusiveiy either the 5'- or the 3'-terminus of the original RNA woufd suffice. In practice, 
the 5'-fragments are better suited for this purpose, because the spectra obtained after incubation of all three synthetic 
RNA samples contain the nearly complete set of originals of 5'-ions for ail different RNases (Hahner, S., Kirpekar, F, 
Nordoff, E., Kristiansen, K., Roepstorff and Hillenkamp, F (1 996) Proceedings of the 44th ASMS Conference on Mass 
Spectrometry, Portland, Oregon), internal fragments are somewhat less abundant and fragments containing the original 
3'-terminus appear suppressed in the spectra. In agreement with observations reported In the literature (Gupta, R, C. 
and Randerath, K. (1977) Nucleic Acids Res., 4, 1957-1978), cleavages close to the 3'-terminus were partially sup- 
pressed in partial digests of the RNA 25 mer by RNase T^^ and U2 (even if they are interna! or contain the originai 5'- 
terminus). Fragments from such cleavages appear as weak and poorly resolved signals in the mass spectra, 
[0454] For larger RNA molecules secondary structure is known to influence the uniformity of the enzymatic cleavages 
(Donis-Kelier, H.. Maxam, A, M. and Gilbert, A. (1977) Nucieic Acids Res, 8, 3133-3142). This can, in principle be, 
overcome by altered reaction conditions. In assay solutions containing 5-7 M urea, the activity of RNases such as T2, 
Ug. A, CI3, and PhyM is known to be retained (Donis-Keller, H., Maxam, A. M. and Gilbert, W. (1977) Nucieic Acids 
Res,, 4, 2527-2537; BoguskI, M. S., Hieter. P.A., and Levy, C. C. (1980) J. BioL Chem,,2SS, 2160-2163; Donis-Kelier, 
H. (1980) Nucleic Acids Res,, 8, 3133-3142, while RNA is sufficiently denatured. UV-MALDI-anaiysIs with 3-HPA as 
matrix is not possible under such high concentrations of urea in the sample, Up to a concentration of 2 M urea in the 
reaction buffer, MALDI analysis of the samples was still possible although significant changes in matrix crystallization 
were observed. Spectra of the RNA 20 mer (sample B), digested in the presence of 2 M urea still resembled those 
obtained under conditions listed in Table VII. 

[0455] Digestion by RNases which exclusively recognize one nucleobase is desirable to reduce the complexity of 
the fragment patterns and thereby facilitate the mapping of the respective nucleobase. RNases CL3 andcursavitin are 
enzymes reported to cleave at cytidyfic acid residues. Upon limited RNase CL3 and cursativin digestion of the RNA- 
20mer (sample B) under non-denaturing conditions, fragments corresponding to cleavages at cytidylic residues were 
indeed observed (Figure 78). Similar to the data reported so far (Boguski, M. S., Hieter, P. A. and Levy, C. C. (1980) 
J, BioL Cftem.,255»2160-2163: Rojo, M. A., Arias, F J., Iglesias, R., Ferreras, J. M., Munoz, R., Escarmis, C, Soriano, 
F, Liopez-Fando, Mendez, E., and Girbes. T (1994) Plants, 194. 328-338), The degradation pattern in Figure 78, 
however, reveals that not every cytidine residue is recognized, especially for neighboring C residues. RNase CL3 is 
also reported to be susceptible to the influence of secondary structure (Boguski, M, S., Helter, R A., and Levy, C. C. 
(1980) J Chem., 255, 2160-2163), but for RNA of the size employed in this study, such an influence should be 
negligibJe. Therefore, unrecognized cleavage sites in this case can be attributed to a lack of specificity of this enzyme. 
To confimn these data, a further RNase CLa-digestion was performed with the RNA 20mer (sample C). As a result of 
the sequence of this analyte, all three linkages containing cytidylic acid were readily hydrolyzed, but additional cleav- 
ages at uridyJic acid residues were detected as well Since altered reaction conditions such as increased temperature 
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(90'C). various enzyme to substrate ratios, and addition of 2IW urea did not result in a digestion of tlie expected spe- 
cificity, application of this enzyme to sequencing was not pursued further. Introduction of a new cytldine-specific ribo- 
nuclease, cusativin, isolated form dry seeds of Cucumis sativus L looked promising for RNA sequencing (Rojo, M. A., 
Arias, F. J., Iglesias, R,, Ferreras, J. M., Munoz, R., Escarmis, C, Soriano, F., Llopez-Fando, J., Mendez, E.andGirbes, 
T. (1994) PtantSr 194, 32S-338). As shown in Figure 78, not every cytldine residue was hydrolyzed and additional 
cleavages occurred at uridyiic acid residues for the recommended concentration of the enzyme. RNases CLg and 
cusativin will, therefore not yield the desired sequence information for mapping of cytidine residues and their use was 
not further pursued. The distinction of pyrimldine residues can be achieved, however, by use of RNases with multiple 
specificities, such as Physarum poiycephalum RNase (cleaves ApN, UpN) and pancreatic RNase A (cleaves UpN, 
CpN) (see Figure 77). All 5*-termrnus fragments, generated by the monospecific RNase and apparent in the spectrum 
of Figure 77C were also evident in the spectrum of the RNase PhyM digest (Figure 77D). Five of the six uridilic cleavage 
sites could, this way, be uniquely Identified by this indirect method. In a next step, the knowledge of the uridine cleavage 
sites was used to identify sites of cleavage of cytidilic acid residues in the spectrum recorded after incubation with 
RNase A (Figure 77E), again using exclusively ions containing the original 5'-terminus. Two of the four expected cleav- 
age sites were identified this way. A few imitations are apparent from these spectra, if only the fragments containing 
the original 5'-tenninus are used for the sequence determination. The first two nucleotides usually escape the analysis, 
because their signals get lost in the low mass matrix background. Because of this, the con^espondtng fragments are 
missing in the spectra of the U- and C-specific cleavages. Large fragments with cleavage sites close to the 3'-temninus 
are often difficult to identify, particularly in digests with RNases T''- and Ug. because of their low yield (vide supra) and 
the often strong nearby signal of the non-digested transcript. Accordingly the cleavages In position 22 and 23 do not 
show up in the spectrum of the G-specific RNase T, (Figure 77A) and the cleavage site 24 cannot be identified from 
the spectra of the and PhyM digests (Figures 77 C and D). Also site 16 and 17 with two neighboring cytidilic acids 
cannot be identified in the RNase A spectrum of Figure 77E. These observations demonstrate that a detenni nation of 
exciusfvely the 5'-termlnus fragments may not always suffice and the information contained in the internai fragments 
may be needed for a full sequence analysis. 

[0456] Finally, limited alkaline hydrolysis provides a continuum of fragments (Figure 77B), which can be used to 
complete the sequence data. Again, the spectrum is dominated by ions of fragments containing the 5'-terminus, al- 
though the hydrolysis should be equal for ail phosphodiester bonds. As was true for the enzymatic digests, correct 
mass assignments requires one to assume that ail fragments have a 2', 3'-cycilc phosphate. The distribution of peaks, 
therefore, resembles that obtained after a 3'-exo nuclease digest (Pieles. U., Zurcher, W., Schar, M. and Moser, H. E., 
(1993) Nucleic Acids Res., 21 , 31 91 -31 96; Nordhoff, E. et al. (1 993) Book of Abstracts, 1 3*^ Internat. Mass Spectrom. 
Conf., Budapest p, 218; Kirpekar, F, Nordhoff, E., Kristiansen, K., Roepstorff, P., Lezius, A. Hahner. S., Karas, M. and 
Hiilenkamp, F (1994) Nucleic Acid Res., 22, 3866-3870). In principle, the alkaline hydrolysis alone could, therefore, 
be used for a complete sequencing. This is, however, only possible for quite small oiigoribonucieotides, because larger 
fragment ions, differing in mass by only a few mass units will not be resolved in the spectra and the mass of larger 
ions cannot be determined with the necessary accuracy of better than 1 Da, even if peaks are partiaify orfuliy resolved. 
The interpretation of the spectra particularly from digests of unknown RNA samples is substantially simplified, if only 
the fragments containing the original 5'-tenninus are separated out prior to the mass spectrometric analysis. A proce- 
dure for this approach is described in the folfowing section. 

[0457] Separation of S'-biotinylated fragments. Streptavidin-coated magnetic beads (Dynai) were tested for the ex- 
traction of fragments containing the original 5'-tenninus from the digests. Major features to be checked for this solid- 
phase approach are the selective immobilization and efficient eiution of biotinylated species, in preliminary experiments, 
a 5*-biotinyiated DNA (1 9 nt) and streptavidin were incubated and MALDI analyzed after standard preparation. Despite 
the high affinity of the streptavidin-biotin interaction, the intact complex was not found in the MALDI spectra. Instead, 
signals of the monomeric subunit of streptavidin and the biotpnyiated DNA were detected. Whether the complex dis- 
sociates in the acidic matrix solution (pKA 3) or during the MALDI desorption process, is not known. Surprisingly, if the 
streptavidin is immobilized on a solid surface such as magnetic beads, the same results are not observed. A mixture 
of two 5'-biotinytated DNA samples (1 9 nt and 27 nt) and two unlabeled DNA sequences ( 1 2 nt and 22 nt) were incubated 
with the beads. The beads were extracted and carefully washed before incubation in the 3-HPA MALDI matrix. No 
anaiyte signals could be obtained from these samples. To test whether the biotinylated species had been bound to the 
beads altogether, eiution fonn the extracted and washed beads was performed by heating at 90*C in the presence of 
95% formamide. This procedure is expected to denature the streptavfdin , thereby breaking the streptavidin/biotin com- 
plex. Figure 79Bshows the expected signals of thetwo biotinyfated species, provingthat release of the bound molecules 
in the MALDI process is the problem rather than the binding of the beads; Figure 79A shows a spectnjm of the same 
sample after standard preparation, showing signals of ail four anaiytes as a reference. Complete removal of the for- 
mamide after the eiution and prior to the mass spectrometric analysis was found to be important, otherwise crystalli- 
zation of the matrix is disturbed. Mass resolution and the signal-to-ncise ration in spectrum 798 are comparable to 
those of the reference spectrum. These results testify to the specificity of the streptavidin-biotin interaction, since no 
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or only minor signals of the non-biotinyiated anaiyte were detected after incubation witli the Dynal beads, increased 
suppression of nonspecific binding was reported through an addition of the detergent Tween-20 to the binding buffer 
(Tong, X. and Smith, L. M. (1992) Anal Chem., 64, 2672-2677). Although this effect could be confimned In this study, 
peal< broadening affected the quality of the spectra due to remaining amounts of the detergent. The necessity of an 
eiutlon step as a prerequisite for detection of the captured biotinylated species can be attributed to a stabiiizing effect 
of the compiex by the Immobliization of the streptavidin to the magnetic beads. 

[0458] For practicai application of this solid phase method to sequencing a maximum efficiency of binding and eiutton 
of biotinylated species is of prime importance. Among a variety of conditions investigated so far, addition of salts such 
as EDTA gave best results in the case of DNA sequencing by providing ionic strength to the buffer (Tong, X. and Smith, 
L. M. (1 992) AnaiChem,,M, 2672-2677). To examine such an effect on the solid-phase method, several salt additives 
were tested for the binding and elution of the 5'-biolinylated RNA in vitro transcript (49 nt). The results are shown in 
Figure 80. Judging from the relative intensity, signal-to-noJse ration, and resolution of the respective signals, a 95% 
formamide solution containing 10 mM CDTA (Figure SOD) is most efficient for the binding/elution. Since CDTA acts as 
a chelating agent for divalent cation, formation of proper secondary an tertiary structure of the RNA is prevented. An 
improved sensitivity and spectral resolution has been demonstrated under such conditions for the analysis of RNA 
samples by eiectrospray mass spectrometry (Limbach, P. A., Grain, P. F. and IVicCloskey, J. A. (1 995) J Am. Soc. Mass. 
Spectrom., 6, 27-39). The Improvement in the MALDI analysis is actually not very significant compared to the spectrum 
obtained for the solution containing formamide alone (Figure 81b), but the reproducibility for spectra of good quality 
was substantially improved for the CDTA/fomnamide solution. Thus in addition to the improved binding/elution, this 
additive may also improve the incorporation of the anaiyte into the matrix crystals. Unfortunately, a striking signal 
broadening on the high mass side was observed in case of fonnamide solutions containing EDTA, CDTA or 25% 
ammonium hydroxide. Since this effect is most prominent in case of 25% ammonium hydroxide and this agent was 
also used for adjusting EDTA and CDTA to their optimum pH, a pronounced NH3 adduct ion fomnation ca be assumed. 
[0459] The applicability of streptavidin-coated magnetic beads separation to RNA sequencing was demonstrated for 
the Rnase U2 digest of the 5'-biotinylated RNA in Wfrotranschpt (49 nt) (Figure 81 ). The entire fragment pattern obtained 
after incubation with Rnase Ug is shown is spectrum 81 A. Separation of the biotinylated fragments reduces the com- 
plexity of the spectrum (Figure 81 B) since only 6'-terminal fragments are captured by the beads. The signals in the 
spectrum are broadened and the increased number of signals in the low mass range indicate that even after stringent 
washing of the beads, some amounts of buffer and detergent used for the binding and elution remained. Further im- 
provements of the method are, therefore, needed. Another possible strategy for application of the magnetic beads Is 
the immobilization of the target RNA prior to RNase digestion by an elution of the remaining fragments for further 
analysis. Cleavage of the RNA was impeded in this case, as evidenced by a prolonged reaction time for the digest 
under othenwise identical reaction conditions. 

EXAMPLE 22 

Parallel DNA Sequencfng Mutation Analysis and MJcrosatellfta Analysis Using Prinners with Tags and Mass 
Spectrometric Detection 

[0460] This EXAMPLE describes specific capturing of DNA products generated irt DNA analysis. The capturing is 
mediated by a specific tag (6 to 8 nucleotides long) at the 5' end of the analysis product that binds to a complementary 
sequence. The capture sequence can be provided by a partially double stranded oligonucleotide bound to a solid 
support. Different DNA analysis (e.g., sequencing, mutation, diagnostic, microsatellite analysis) can be carried out in 
parallel, using, for example, a conventional tube or microtlter plate (MTP). The products are then specrfically captured 
and sorted out via the complementary identification sequence on the tag oligonucleotide. The capture oligonucleotide 
can be bound onto a solid support (e.g., silicon chip) by a chemical or biological bond. Identification of the sample is 
provided by the predefined position of the caputs otigonucteotide. Purification, conditioning and analysis by mass spec- 
trometry are done on solid support. This method was applied for capturing specific primers that had a 6 base tag 
sequence. 

MATERIALS AND METHODS 

Genomic DNA. 

[0461] Genomic DNA was obtained from healthy individuals. 
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PGR AmpUtication 

[0462] PGR amplifications of part of the p-globin gene were established using p2 d(CATTTGCTTCTGACACAACT 
Seq- ED. No, 66) as forward primer and pi 1 d{TCTCTGTCTCCACATGCCCAG Seq. fD. No. 67) as reverse primer. The 
total PGR volume was 50 jil including 200 ng genomic DNA, 1 U Taq-polymerase (Boehringer-l\1annhejm, Cal# 159594), 
1 .5 mM MgCtg. 0.2 mM dNTPs (Boehrlnger-Mannhelm, Cat# 1 277049), and 1 0 pmof of each primer. A specific fragment 
of the p-globin gene was amplified using the following cycling conditions: 5 min @ 94'C followed by 40 cycles of 30 
sec @ 94'G, 45 sec @ 53*C, 30 sec <^ 72'C, and a final extension of 2 min @ 72*0. Purification of the amplified 
product and removai of unincorporated nudeotides was carried out using the QIAquick purification kit (QIagen, Cat 
281 04). Onefifth of the purified product was used forthe primer oligo base extension (PROBE) or sequencing reactions, 
respectively. 

Primer ofigo base extension (PROBE) and sequencing reactions 

[0463] Detection of putative mutations in the human p-globin gene at codon 5 and 6 and at codon 30 and in the IVS- 
1 donor site, respectively, was done in parallel (FIGURE 82A). p-TAGI (GTGGTCCCATGGTGCACGTGAGTG Seq. 
ID, No. 68) served as primer to analyze codon 5 and 6 and P-TAG2 (CGCTGTGGTGAGGCCCTGGGCA Seq. ID. No. 
69) forthe analyses of codon 30 and the IVS-1 donor site. The primer oligo base extension (PROBE) reaction was 
done by cycling, using the following conditions: final reaction volume was 20 jii, p-TAGI primer (5 pmol), p-TAG2 primer 
(5 pmol), dCTP, dGTP, dTTP, (final concentration each 25 ^IWI), ddATP (final concentration 100 ^M) dNTPs and ddNTPs 
purchased from Boeringer-IWannheim, Cat# 1277049 and 1008382), 2 ^1 of lOx ThermoSequence buffer and 2.5 U 
ThermoSequenase (Amersham, GAT#E79000Y), The cycling program was as follows: 5 min @ 94*C, 30 sec @ 53*C, 
30 sec @ 72*C and a final extension step for 8 min @ 72*0. Sequencing was performed under the same conditions 
except that the reaction volume was 25 \i\ and the concentration of nucleotides was 250 m-M for ddfsJTP. 

Capturing using TAG sequence and sampie preparation 

[0464] The capture oligonucleotides cap-tag1 d(GACGACGACTGCTACCTGACTCCA Seq ID No. 70) and cap-tag2 
d(ACAGCGGACTGCTACCTGACTCCA Seq ID No. 71), respecttvely, were annealed to equimolar amounts of uni-as 
d{TGGAGTGAGGTAGCAGTC Seq ID No. 72) (FIGURE 82A). Each oJigonucieotide had a concentration of 10 pmol/ 
jxl in ddHaO and incubated for 2 min O 80"C and 5 min @ 37"C. This solution was stored at -SO^'C and aliquots were 
taken. 10 pmol annealed capture oligonucletides were bound to 10 |il paramagnetic beads coated with streptavidin 
(1 0 mg/mf; Dynaf, Dynabeads M-280 streptavidin Cat# 112.06) by rncubation for 30 min @ 37*C. Beads were captured 
and the PROBE or sequencing reaction, respectively, was added to the capture oligonucleotides. To facilitate binding 
of p-TAGI abd p-TAG2, respectively, the reaction was incubated for 5 min @ 25**C and forSO min @ 1 6*C. The beads 
were washed twice with ice cold 0.7 M NH4 Citrate to wash away unspecific bound extension products and primers. 
The bound products were dissolved by adding 1 ^lI DDHgO and Incubation for 2 min @ 65*C and cooling on ice. 0.3|.il 
of the sample were mixed with 0.3|al matrix solution (saturated S-hydroxypicoiinic acid, 10% molar ratio ammonium- 
citrate in acetonitrile/water (50/50. v/v)) and allowed to air dry. The sample target was automatically introduced into 
the source region of an unmodified Perspective Voyager MALDI-TOF operated in delayed extraction linear mode with 
5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular mass (Mr(ca!c)) were 
calculated from atomic compositions; reported experimental Mr(Mr(exp)) values are those of the singly-pronated form. 

RESULTS 

[0465] Specific capturing of a mixture of extension products by a short complementary sequence has been applied 
to isolate sequencing and primer oligo base extension (PROBE) products. This method was used for the detection of 
putative mutations in the human p-globin gene at codon 5 and 6 and at codon 30 and lVS-1 donor site, respectively 
(FIGURE 82A). Genomic DNA has been amplified using the primers p2 and p1 1 . The ampfificatfon product was purified 
and the nucleotides separated. One fifth of the purified product was used for analyses by primer oligo base extension. 
To analyze both sites in asingle reaction, primers, p-TAG1 and P-TAG2, were used respectively. p-TAGI binds upstream 
of codons 5 and 6 and p-TAG2 upstream of codon 30 and the IVS-1 donor site. Extension of these primers was per- 
formed by cycling in the presence of ddATP and dCTP, dGTP and d.TTP. leading to specific products, depending on 
the phenotype of the individual. The reactions were then mixed with the capture oligonucleotides. Capture oligonucle- 
Qtides include the biotinylated capture primer cap-tag1 and cap-tag2, respectively. They have 6 bases at the 5" end. 
that are complementary to the 5' end of p-TAGI and P-TAG2, respectively. Therefore, they specifically capture these 
primers and the extended products. By annealing a universal oligonucleotide (uni-as) to the capture oligonucleotide, 
the capture primer is transformed into a partially double stranded molecule where only the capture sequence stays 
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singfe stranded (Figure 82). This molecule Is then bound to streptavldin coated paramagnetic particles, to which the 
PROBE or sequencing reaction, respectively Is added. The mixture was washed to bind only the specifically annealed 
ollgonucleatides. Captured oligonucleotides are dissolved and analyzed by mass spectrometry. 
[0466] PROBE products of one individual (Fig. 83) show a small peak with a molecular mass of 7282.8 Da. This 
corresponds to the unextended p-TAG1 that has a calculated mass of 7237.8 Da. The peal< at 8498.6 Da coTesponds 
to a product, that has been extended by 4 bases. This corresponds to the wildtype situation. The calculated mass of 
this product Is 8500.6 Da. There Is no significant peak indicating a heterozygote situation. Furthermore only p-TAGI 
and not p-TAG2 has been captured, indicating a high specificity of this method. 

[0467] Analyses of what was bound to cap-tag2 (Figure 84) shows only one predominant peak with a molecular 
mass of 9331 ,5 Da. This corresponds to an extension of 8 nucleotides. It Indicates a homozygous wildtype situation 
where the calculated mass of the expected product is 9355 Da. There is no significant amount of unextended primer 
and only P-TAG2 has been captured. 

[0468] To prove that this approach is also suitable for capturing specific sequencing products, the same two primers 
P-TAG1 and p-TAG2, respectively, were used. The primers were mixed, used in one sequencing reaction and then 
sorted by applying the above explained method. Two different termination reactions using ddATP and ddCTP were 
perfonmed with these primers (Figures 85 and 86, respectively). All observed peaks In the spectrograms correspond 
to the calculated masses in a wildtype situation. 

[0469] As shown above, parallel analysis of different mutations (e.g. , different PROBE primers) is now possible. 
Further, the described method is suitable for capturing specific sequencing products. Capturing can be used for sep- 
aration of different sequencing primers out of one reaction tube/well, Isolation of specific multiplex-amplified products, 
PROBE products, etc. Conventional methods, like cycle sequencing, and conventional volumes can be used. A uni- 
versal chip design permits the use of many different applications. Further, this method can be automated for high 
throughput. 

EXAMPLE 23 

Deletion Detection by Mass-Spectrometry 

[0470] Various fomnats can be employed for mass spectrometer detection of a deletfon within a gene. For example, 
molecular mass of a double standard amplified product can be detemiined, or either or both of the strands of a double 
stranded product can be isolated and the mass measured as described in previous examples. 

[0471] Alternatively, as described herein, a specific enzymatic reaction can be perfomried and the mass of the cor- 
responding product can be determined by mass spectrometry. The deletion size can be up to several tens of bases in 
length, still allowing the simurtaneous detection of the wildtype and mutated alfele. By simultaneous detection of the 
specific products, it Is possible to identify in a single reaction whether the individual is homozygous or heterozygous 
for a specific allele or mutation. 

MATERIALS AND METHODS 

Genomic DNA 

[0472] Leukocyte genomic DNA was obtained from unrelated healthy individuals. 
PGR ampiification 

[0473] PCR amplification of the target DNA was established and optimized to use the reaction products without a 
further purification step for capturing with streptavidin coated beads. The primers for target amplification and for PROBE 
reactions were as follows: 

[0474] CKRA-F:d{CAG CTC TCA TTT TOO ATA C SEQ ID. NO. 73) and CKRA-R bio: d(AGC CCC AAG ATG ACT 
ATC SEQ ID. NO. 74). CKR-5 was amplified by the following program; 2 min @ 94"C, 46 seconds @ 52'*C, 5 seconds 
@ 72*C, and a final extension of 5 minutes at 72'C. The final volume was 60 |jil including 200 ng genomic DNA 1 U 
Taq-polymerase (Boehringer-IWannheim, Gat # 1596594), 1.5 Mm MgCI^, 0.2 Mm DNTPS (Boehringer-Mannheim, Cat 
# 1277049), 10 pmol of unmodified forward primers, and 8 pmol 5' biotinylated reverse primer. 

Capturing and Denaturation of Biotinylated Templates 

[0475] 10 \i\ paramagnetic beads coated with streptavidin (10 mg/ml; Dynal. Dynabeads M-2B0 streptavidin Cat # 
112.06) in 5x binding solution (5M NH4CI. 0.3 M NH4OH) were added to 45 ^1 PCR reaction (5 \l\ of PCR reaction were 
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saved for electrophoresis). After binding by incubation for 30 min. at 37'C the supernatant was discarded. Captured 
templates were denatured with 50 jil of 1 00 Mm NaOH for 5 min. at ambient temperature, washed once with 50 \i\ 50 
Mm NH4OH and three times with 100 |xl 10 Mm Tris/CI, Ph 8.0. The single stranded DNA served as templates for 
PROBE reactions. 


Primer Ofigo Base Extension (PROBE) Reaction 


[0476] The PROBE reaction was perfomied using Sequence 2.0 (USB Cat # E70775Z Including buffer). dATP/DGTP 
and ddTTP were supplied by Boehringer-Mannheim (Cat # 1 277049 and 1 008382). d{CAG CTC TCA TTT TCC ATA 
C (SEQ ID. NO. 73) was used as PROBE primer (Figure 87). The following sofutions were added tot he beads: 3.0 ^.i 
H2O, 1.0^1 reaction buffer, 1.0 jxl PROBE primer (10 pmoi) and incubated at 65*0 for 5 minutes followed by 37^Cfor 
10 min. Then 0.5 |^l DTT, 3.5 ^1 DNTPS/ddntp each 50 \iM and 0.5 j^l Sequenase (0.8 U) were added and incubated 
at37'C for 10 min. 


T4 Treatment of DNA 


[0477] To generate blunt ended DNA, amplification products were treated with T4 DNA polymerase (Boehringer- 
Mannheim Cat# 1 004786). The reactions were carried out according to the manufacturer's protocol for 20 min. at 11 **C. 

Direct Size Determination of Extended Products 

[0478] To determine the size of the amplified product, MALDJ-TOF was applied to one strand of the amplification 
product, samples were bound to beads, as described above, conditioned and denatured, as described below 

DNA Conditioning 

[0479] After the PROBE reaction the supernatant was discarded nd the beads were washed first in 50 700 mM 
NH^^cftrate and second 50 ^li 50 mM NH^-citrate. The generated diagnostic products were removed for the template 
by heating the beads in 2 \ii H^O at 80'C for 2 min. The supernatant was used for MALDI-TOF anatysis. 

Sampie Preparation and Anaiysis with MALDt-TOF Mass Spectrometry 

[0480] Sample preparation was performed by mixing 0.6 ^1 of matrix solution (0.7 M 3-hydroxypicolinic acid 0 07 M 
dibasic citrate in 1 :1 H^OiCHgCN) with 0.3 ^1 of diagnostic PROB^ products In water on a sample target and'ailowed 
to air dry. Up to 100 samples were spotted on a probe target disk fonntroduction into the source region of an unmodified 
Perspective Voyager MALDI-TOF instrument operated in linear mode with delayed extraction and 5 and 30 kV on the 
target and conversion dynode. respectively. Theoretical average molecular mass CM^Ccaic)) of analytes were calculated 
from atomic compositions, reported experimental M,(Mr(exp)) values are those of the singly-pronatedform. determined 
using internal calibration with unextended primers in the case of PROBE reactions. 


Conventional Analyses 


[0481] Conventional analyses were performed by native polyacrylamide gel electrophoresis seconding to standard 
protocols. The diagnostic products were denatured with formamide prior to loading onto the gels and stained with 
ethidium bromide or silver, respectively. 


RESULTS 


[0482] The CKR-5 status of 1 0 randomly chosen DNA samples of healthy individuals were analyzed. Leukocyte DNA 
was amplified by PGR and an aliquot of the amplified product was analyzed by standard polyacrylamide gel electro^ 
phoresis and silver staining of the DNA (Figure BS). Four samples showed two bands presumably indicating hetero- 
zygosity for CKR-5, whereas the other 6 samples showed one band, corresponding to a homozygous gene (Figure 
88). fn the case where two bands were observed, they correspond to the expected size of 75 bp for the wildtype gene 
and 43 bp for the allele with the deletion (Figure 87). Where one band was observed, the size was about 75 bp which 
rndicated a homozygous wildtype CKR-5 allele. One DNA sample derived from a presumably heterozygous one from 
a homozygous individual were used for all further analysis. To detemnine the molecular mass of the amplified product, 
DNA was subjected to matrix assisted laser desorption/ionization coupfed with time of flight analysis (MALDI-TOF)' 
Double stranded DNA, bound to streptavidln coated paramagnetic particles, was denatured and the strand released 
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into the supernatant was analyzed. Figure 89A shows a spectrograph of a DNA sample, that was supposed to be 
heterozygous according to the result derived by potyacrylamide gel electrophoresis (Figure 88). The calcutated mass 
of the sense strand for a wildtype gene is 23036 Da and for the sense strand carrying the deletion allele 1 3143 (Figure 
87 and Table VI). Since many thermostable polymerases unspecifically add an adenosine to the 3' end of the product, 
those masses were also calculated. They are 23349 and 1 3456 Da. The masses of the observed peaks (Figure 89A) 
are 23119 Da, which corresponds to the calculated mass of a wlldtype DNA strand where an adenosine has been 
added (23349 Da). Since no peak with amass of about 23036 Da was observed, the polymerase must have qualitatively 
added adenosine. Two peaks, which are close to each other, have a mass of 13451 and 13137 Da. This corresponds 
to the calculated masses of the allele, with the 32bp deletion. The higher mass peak corresponds to the product, where 
adenosine has been added and the lower mass peak to the one without the unspecific adenosine. Both peaks have 
about the same height, indicating that to about half of the product adenosine has been added. The peak with a mass 
of 11682 Da is a doubly charged molecule of the DNA corresponding to 23319 Da (2 x 11682 Da - 23364 Da). The 
peaks with masses of 6732 and 6575 Da are doubly charged molecules of the one with masses of 13451 and 13137 
Da and the peak with 7794 Da corresponds to the triply charged molecule of 2331 9Da. Multfpie charged molecules 
are routinefy identified by calculation. Amplified DfvIA derived from a homozygous individual shows fn the spectrograph 
(Figure B9C) one peak with a mass 23349.6 and a much smaller peak with a mass of 23039.9 Da. The higher mass 
peak corresponds to DNA resufting from a wildtype allele with an added adenosine, that has a calculated mass of 
23349 Da. The lower mass peak corresponds to the same product without adenosine. Three further peaks with a mass 
of 11686, 7804.6 and 5852.5 Da correspond to doubly, triply and quadruply charged molecules. 
[0483] The unspecific added adenine can be removed from the amplified DNA by treatment of the DNA and T4 DNA 
polymerase. DNA derived from a heterozygous and a homozygous individual was analyzed after T4 DNA polymerase 
treatment. Figure B9B shows the spectrograph derived from heterozygous DNA. The peak corresponding to the wildtype 
strand has a mass of 23008 Da indicating that the added adenine had been removed completely. The same is observed 
for the strand with a mass of 13140 Da. 

The otherthree peaks are multiply charged molecules of the parent peaks. The mass spectrograph forthe homozygous 
DNA shows one peak that has a mass of 23004 Da, corresponding to the wildtype DNA strand without an extra adenine 
added. All other peaks are derived from multiply charged molecules of this DNA. The amplified products can be analyzed 
by direct detemnination of their masses, as described above, or by measuring the masses of products, that are derived 
from the amplified product in a further reaction. In this "primer oligo base extension (PROBE)" reaction, a primer that 
can be internal, as It is In the nested PGR, or identical to one of the PGR primers, is extended for just a tew bases 
before the termination nucleotide is incorporated. Depending on the extension length, the genotype can be specified. 
GKRA-F was used as a PROBE primer, and dATP/dGTP and ddTTP as nucleotides. The primer extension is AGT fn 
case of a wUdtype template and AT in case of the deletion (Figure 87). The corresponding masses are 6604 Da forthe 
wildtype and 6275 Da for the deietion, respectively. PROBE was applied to two standard DNAs. The spectrograph 
(Figure 90A) shows peaks with masses of 6604 Da corresponding to the wildtype DNA and at 6275 Da corresponding 
to the GKR-5 deletion allele (Table VIII). The peak at a mass of 5673 Da corresponds to CKRA-F (calculated mass of 
5674 Da). Further samples were analyzed in analogous way (Figure 90B). It Is unambiguously identified as homozygous 
DNA, since the peak with a mass of 6607 Da corresponds to the wildtype allele and the peak with a mass of 5677 Da 
to the unextended primer. No further peaks were observed. 

[0484] The example demonstrates that deletion analysis can be performed by mass spectrometry. As shown herein, 
the deletion can be analyzed by direct detection of single stranded amplified products, or by analysis of specifically 
generated diagntic products (PROBE). In addition, as shown in the following Example 26, double stranded DNA am- 
plified products can be analyzed. 


Size 

Calculated Mass 

Measured Mass 

wildtype w/o A 

23036 

23039/23009/23004 

wildtype with A 

23349 

23319/23350 

deletion w/o A 

13143 

13137/13139 

deletion with A 

13456 

13451 

PROBE 



wildtype 

6604 

6604/6608 

deletion 

6275 

6275 


All masses are in Dafton. 
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EXAMPLE 24 
Pentaptex tc-PROBE 

SUMMARY 

[0485] The multiplexing of themnocycling primer oligo base extension (tc-PROBE) was performed using five poly- 
morphic sites In three drfferent apolipoprotein gerres, which are thoifght to be involved In the pathogenesis of athero- 
sclerosis. Theapollpoprotein A IV gene (codons 347 and 360), the apolipoprotein E gene (codons 112 and 158), and 
the apolipoprotein B gene (codon 3500) were examined. All mass spectra were easy to interpret with respect to the 
five polymorphic sites. 

MATERIALS AND METHODS 

PGR Ampiification 

[0466] Human leukocytic genomic DNA was used for PGR. Listed below are the primers used for the separated 
amplification of portions of the Apo A fV, Apo E and the Apo B genes: 


Apo A IV: A347F: 5'-CGA GGA GOT CAA GGC CAG AAT-3' 

(SEQ 10 NO. 75) 

A360 R-2-bio: *5'-CAG GGG CAG CTC AGC TCT 

(SEQ ID NO. 76) 

Apo E: ApoE-F: B'-GGC ACG 6CT GTC CAA GGA-3' 

(SEQ ID NO. 77) 

ApoE-R bio; *5'-AGG CCG CGC TCG GCG CCC TC-3' 

(SEQ ID NO. 78) 

Apo B: ApoB-F2 bio: *5^CTT ACT TGA ATT CCA AGA GC-3' 

(SEQ ID NO. 79) 

Apo e-R: B'-GGG CTG ACT TGC ATG GAC CGG A-3' 

(SEQ ID NO. 80) 

• bfotinylated 

[0487] Taq polymerase and lOx buffer were purchased from Boehringer-Mannheim (Gennany) and dNTPs for Phar- 
macia (Freiburg, Germany). The total PGR reaction volume was including 10 pmol of each primer and 1 0% DMSO 
(dimethylsu If oxide, Sigma) (no DMSO for the PGR of the Apo B gene), with -200 mg of genomic DNA used as template 
and a final dNTP concentration of 200 \M. Solutions were heated to BO'^C before the addition of 1 U Taq polymerase; 
PGR conditions were: 5 min at gS^C. followed by 2 cycles 30 sec 94"G, 30 sec 62**C, 30 sec 72*C, 2 cycles 30 sec 
94^'C 30 sec 5B*C, 30 sec 72'*C, 35 cycles of 30 sec at 94*C, 30 sec at 56**C, 30 sec at 72'C, and a final extension 
time of 2 min at 72'C. To remove unincorporated primers and nucleotides, amplified products were purified using the 
"QIAquick" (Qiagen, Gemnany )kit, with elution of the purified products in 50^L of TE buffer (10 mM Tris-HCI 1mM 
EDTA, pH B.O). 

Binding of the ampiified product on beads 

[0488] 1 0 111 of each purified amplified product was bound to DynaBeads (Dynal, M-2S0 Streptavidm) and dena- 
tured according to the protocol from Dynal. For the pentaplex tc-PROBE reaction the three different amplified product 
(bound on the beads) were pooled. 

Tc-PROBE 

[0489] For the PROBE reaction the following primers were used: 
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(Apo A) P347: 5'-AGC CAG GAC AAG-3' (SEQ ID NO. 81) 
(Apo A) P360: 5'-ACA GCA GGA ACA GCA-3' (SEQ ID NO. 82) 
(Apo E) P112: 5'-GCG GAC ATG GAG GAC GTG-3' (SEQ ID NO. 63) 
(Apo E) P158: 5'-GAT GCC GAT GAC CTG CAG AAG-3'(SEQ ID NO. 84) 
(Apo B) P3500: 5'GTG CCCTGC AGC TTC ACT GAA GAC-3'(SEQ ID NO. 85) 

[0490] The tc-PROBE was carried out in a final volunne of 25 ^il containing 10 pmol of each prinner listed above, 2.5 
U Themioquenase (Amersham), 2.5 |iL Thermoquenase buffer, and 50 |xM dTTP (final concentrations) and 200 |iM 
of ddA/C/GTP, respectively. Tubes containing the mixture were placed in a thennocycler and subjected to the following 
cycling conditions: denatirration (94*'C) the supernatant was carefulJy removed from the beads and 'desalted' by ethanol 
precipitation to exchange nonvolatile cations such as Na+ and K+ with NH4+, which evaporated during the ionization 
process; 5 ^iL 3M ammonium acetate (pH 6.5) 0.5 |iL glycogen (1 0 mg/mL, Sigma), 25 HgO, and 11 0 ^iL absolute 
ethanol were added to 25 ^iL PROBE supernatant and incubated for 1 hour at 4*C. After a 10 min. centrifugation at 
13,000 X g, the pellet was washed in 70% ethanol and resuspended in 1 18 Mohnn/cm H^O. A 0.35 ^iL aliquot of 
resuspended DNA was mixed with 0.35 |j,L matrix solution (0.7 M 3-hydroxypicoiinic acid (3-HPA), 0.07 M ammonium 
citrate in 1 :1 H20:CH3CN) on a stainless steel sample target disk and allowed to afr dry preceding spectrum acquisition 
using the Themno Bioanalysis Version 2000 MALDI-TOF operated In reflectron mode with 5 and 20 kV on the target 
and conversion dynode, respectively, Theorettcal average moiecutar masses (iVI^(calc)) of the fragments were calcu- 
lated from atomic compositions. External calibration generated from synthetic (ATCG)n oligonucleotide (3.6-1 8kDa) 
was used. Positive ion spectra from 1-37500 Da were collected. 

RESULTS 

[0491] Table VIM shows the calcuJated molecuiar masses of all possible extension products including the mass of 
the primer itself. Fig. 91 shows a respective MALDI-TOP MS spectra of a tc-PROBE using three different templates 
and 5 different PROBE primers simultaneously in ne reaction. Comparison of the observed and calculated masses 
(see table VIII) allows a fast genetic profiling of various polymorphic sites fn an individual DNA sample. The sample 
presented in Figure 91 is homozygous for threonine and glutamine at position 347 and 360, respectively, in the apol- 
ipoprotein A IV gene, bears the epsilon 3 allele homozygous in the apolipoprotein E gene, and is also homozygous at 
the codon 3500 for arginine in the apofipoprotein B gene. 


TABLE VIII 



SEQ ID 

mass 

allele 

Apolipoprotein A IV 

5'-AGCCAGGACAAG-3' (347) 

86 

3688.40 

unextended 
primer 

5'-AGCCAGGACAAGTC-3' 

87 

4265.80 

347Ser 

5'-AGCCAGGACAAGA-3' 

88 

3935.60 

347Thr 

5 AC AGCACCA AC AGCA-3' (360) 

89 

4604.00 

unexter^ded 
primer 

5'-ACAGCAGGAACAGCATC-3' 

90 

5181.40 

360His 
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SEQ ID 

mass 

alfele 

5'-ACAGCAGGAACAGCAG'3' (112) 

91 

4917,20 

360Gln 

Apolipoprotein E | 

5'-GCGGACATGGAGGACGTG-3' (1 1 2) 

92 

5629.60 

unextended 
primer 

5'-GCGGACATGGAGGACGTGGC-3' 

93 

6247,00 

1 1 2Cys 

1 5'^GCGGACATGGAGGACGTGC-3' 

94 

5902.80 

112Arg 

5'-GATGCCGATGACCTGCAGAAG-3'n 58) 

95 

6480,20 

unextended 1 
primer 

5'-GATGCCGATGACCTGCAGAAGC-3' 

96 

6753.40 

158Arg 

5'-GATGCCGATGACCTGCAGAAGTG'3' 

97 

7097,60 

1 58Cys 

Apolipoprotein B-100 | 

5'-GTGCCCTGCAGCTTCACTGAAGAC-3' 
{3500) 

98 

7313.80 

unextended U 
prfmer || 

5'-GTGCCCTGCAGCTTCACTGAAGACTG-3' 

99 

7931.20 

3500Gln 1 

5 ' -G TG CC CTG C AG CTTC ACTG AA G ACC- 3 ' 

100 

7587.00 

3500Arg | 


EXAMPLE 25 

Sequencing Exons 5 to 8 of the p53 Gene by MALDI-TOF Mass Spectrometry 

MATERIALS & METHODS 

[0492] Thirty-five cycles of PCR reactions were performed In a 96 well microliter plate with each well containing a 
total volume of 50 i^f including 200 ng genomic DNA, 1 unit Taq DN A polymerase, 1 .5 mM MgCI^, 0.2mM dNTPx 10 
pmof of the forward primer and 6 or 8 of the biotinylated reverse prinner. The sequences of PCR primers prepared 
accordjng to established chemistry (N.D. Sinha, J. Biernat, H. Kter, Tetrahed. Lett. 24:5843-5846 (1 983) are as follows- 
exon 5:d(bfotin-TATCTGTTCACTTGTGCCC SEQ ID NO. 101) and d{brotin-CAGAGGCCTGGGGACCCTG SEQ ID 
NO. 102); exon 6: D(ACGACAGGGCTGGTTGCC SEQ ID NO. 103) and d(biotin-ACTGACAACCACCCTTAAC SEQ 
ID NO. 1 04); exon 7: d(CTGCTTGCCACAGGTCTC SEQ ID NO. 105) and d(biotin-CACAGCAGGCCAGTGTGC SEQ 
ID NO, 106; exon 8: d(GGACCTGATTTCCTTACTG SEQ ID NO, 107) and d(biotin-TGAATCTGAGGCATAACTG SEQ 
ID NO. 108). 

[0493] To each well of the 96-wefl microliter plate containing unpurifled amplified product. 0.1 mg of paramagnetic 
streptavidin beads (Dynal) in 1 0 ^1 of 5 x binding solution (5 M NH4OH) was added and incubated at 3 7**C for 30 min 
Then beads were treated with 0.1 IW NaOH at room temperature for 5 min followed by one wash with 50 mM NH-OH 
at room temperature for 5 min followed by one wash with 50 mM Tris-HCI. 

[0494] Four dideoxy termination reactions were carried out in separate wells of the microliter plate A total of 84 
reactions (21 primers x4 reactions/primer) can be perfonned in a single microliter plate. To each well containing im- 
mobilized smgle-stranded template, a total volute of 10 ^1 reaction mixture was added including 1x reaction buffer 1 0 
pmol of sequencing primer, 250 mM of dNTPs. 25 mM of one of the ddNTPs, and 1-2 units of Thermosequenkse 
(Amersham). Sequencing reactions were carried out on a thermal cyder using non-cycfing conditions" 80**C 1 min 
50-C. 1 min. 50^'C to 72*0, ramping 0.1 Vsec. and 72'*C, 5 min. The beads were then washed with 0.7 M ammonium 
crtrate followed by 0.05 M ammonium citrate. Sequencing products were then removed from beads by heating the 
beads to &0^C in 2 ^1 of 50mM NH4OH for 2 min. The supernatant was used for MALDI-TOF MS analysis 
[0495] Matrix was prepared as described rn Kter. et ai (Kter, H. etaL, Nature Biotechnol. 14: 1123-1128 (1996)) This 
saturated matrix solution was then diluted 1 .52 times with pure water before use. 0.3 nl of the diluted matrix solution 
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was then diluted 1 .52 times with pure water before use. 0.3 \i\ of the diluted matrix solution was loaded onto the sample 
target and allowed to crystallize followed by addition of 0,3 \l\ of the aqueous analyte. A Perseptive Voyager DE mass 
spectrometer was used for the experiments, and the samples were typically analyzed in the manual mode. The target 
and middle plate were kept at + 18,2 kV for 200 nanoseconds after each laser shot and then the garget voltage was 
raised to + 20kV. the ion guide wire in the fifght tube was kept at - 2V. Normaify, 250 laser shots were accumulated foe 
each sampte. ^The originaf spectrum was acquired under 500 MHz digitizing rate, and the final spectrum was smoothed 
by a 455 point average (Savitsky and Golay, (1964) Analytical Chemistry, 36:1627). Default calibration of the mass 
spectrometer was used to identify each peak and assign sequences. The theoretical mass values of two sequencing 
peaks were used to recalibrate each spectrum. (D.R LIttie, T.J. Cornish, M.J. O'Donnel, A. Braun, R.J. Cotter, H. Kter. 
Anal. Cham., submitted). 

RESULTS 

[0496] Alterations of the p53 gene are considered to be a critical step in the development of many human cancers 
(Greenblatt, et aL, (1994) Cancer Res. 54, 4855-4878; C.C. Harris, (1996) J. Cancer, 73, 261-269; and D. Sidransky 
and M. Hoiistein, (1996) Annu.Res.Med., 47,285-301). Mutations may serve as molecular Indicators of clonal rty or as 
eariy markers of relapse in a patient with a previously identified miftation In a primary tumor (Hainaut, et al., (1997) 
Nucleic Acid Res., 25, 151-157). The prognosis of the cancer may differ according to the nature of the p53 mutations 
present (H.S. Goh et al., (1995) Cancer Res, 55, 5217-5221). Since the discovery of the p53 gene, more than 6000 
different mutations have been detected. Exons 5-8 were selected as sequencing targets where most of the mutations 
cluster (Hainautet aL (1997) Nucleic Acids Res., 25, 151-7). 

[0497] Figure 96 schematically depfcts the single tube process for target amplification and sequencing, which was 
performed, as described in detail in the Materials and Methods. Each of exon 5-8 of the p53 gene was PGR amplified 
using flanking primers in the Intron region; the down stream primer was biotinylated. Amplifications of different exons 
were optimized to use the same cycling profile, and the products were used without further purification. PCR reactions 
were performed in a 96 welf microHter plate and the product generated in one well was used as the tempiate for one 
sequencing reaction. Streptavldin-coated magnetic beads were added to the same microfiter plate and amplified prod- 
ucts were immobilized. The beads were then treated with NaOH to generate immobilized sfngle-stranded DMA as 
sequencing template. The beads were washed extensively with Trls buffer since remaining base wouid reduce the 
activity of sequencing enzyme. 

[0498J A total of 21 primers were selected to sequence exon 5-8 of the p53 gene by primer walking. The 3'-end 
nucleotide of all the primers is located at the site where no known mutation exists. Four termination reactions were 
perfonned separately which resulted in a total of 84 sequencing reactions on the same PCR microliter plate. Non- 
cycling conditions were adopted forsequencing since streptavidin coated beads do not tolerate the repeated application 
of high temperature. Sequencing reactions were designed so that mt terminated fragments were under 70 nucleotides, 
a size range easily accessible by MALDI-TOF MS and yet long enough to sequence through the next primer binding 
site. Thermeq uenase was the enzyme of choice since it could reproducible generate a high yield of sequencing products 
in the desired mass range. After the sequencing reactions, the beads were washed with ammonium ion buffers to 
replace all other cations. The sequencing ladders were then removed from the beads by heating in ammonium hydrox- 
ide solution or simply in water. 

[0499] A sub-microliter alfquot of each of the 84 sequencing reactions was loaded onto one MS sample holder con- 
taining preloaded matrix. Figure 94 gives an example of sequencing data generated from one primer; four spectra are 
superimposed. 

[0500] Ail sequencing peaks were well resolved in the mass range needed to read through the next sequencing 
primer site. Sometimes doubly charged peaks were observed which could be easily identified by correlating the mass 
to that of the singly charged ion. False stops generated by early temijnation of the enzymatic extension can be observed 
cle to the primer site. Sfnce the mass resolution is high enough, it is easy to differentiate the false stop peaks from the 
real sequencing peaks by calculating the mass difference of the neighboring peaks and crs comparing the fourspectra. 
Additionally, mt primers generated detectable data through the region of the downstream primer binding site thereby 
covering the false stop region. 

[0501] Using optimized procedures of amplification, sequencing, and conditioning, exons 5-8 of the p53 gene were 
successfully sequenced. Correct wildtype sequence data were obtained from all exons with a mass resolution about 
300 to BOO over the entire mass range. The overall mass accuracy is 0.05% or better. The average amount of each 
sequencing fragment loaded on the MS sample holder is estimated to be 50 fmol or less. 

[0502] This example demonstrates thefeasibility of sequencing exons of a human gene by MALDI-TOF MS. Compare 
to gef-based automated fluorescent DNA sequencing, the read lengths are shorter. Microchip technology can be in- 
corporated to provide for parallel processing. Sequencing products generated in the microtiter plate can be directly 
transferred to a microchip which selves as a launching pad for MALDI-TOF MS analysis. Robot-driven serial and 
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parallel nanoliter dispensing tools are being used to produce 1 00-1 000 element DNA arrays on < 1 " square chips with 
flat or gGometrically altered (e.g. , with wells) surfaces for rapid mass spectronnetric analysis, 

[0503] Figure 94 shows an MS spectrunn obtained on a chip where the sampie was transferred from a microtrter 
plate by a pintoot. The estimated amount of each termination product loaded is 5 fmol or less which is in the range of 
amounts used in conventional Sanger sequencing with radiolabeled or fluorescent detection (0.5-1 fmol per fragment). 
The low volume MALDI sample deposition has the advantages of miniaturization (reduced reagent cts), enhanced 
reproducibility and automated signal acquisition. 

EXAMPLE 26 

Direct detection of syntlietic and biologically generated doubie-stranded DNA by MALDI-TOF MS 
Introduction 

[0504] Typically, matrix-associated laser desorptlon/ionization (Karas, et. al., (1 989) int. J. Mass Spectrom, Ion Proc- 
esses, 92, 231 ) time-of-fiight mass spectrometry (MALDI-TOF MS) of DNA molecules which are double stranded (ds) 
in solution yields molecular ions representative of the two single stranded components (Tang, et ai (1994) Rapid 
Commun. Mass Spectrom. 8:1 83; Tang, etai. (1995) Nucleic Acids Res. 23:3126; Benner, etaL (1995) Rapid Commun. 
MassSpectrom. 9:537; Liu, etai (1995) Anal. Chem. 67:3482; Siegert et af. (1996) Anal. Biochem. 243 :55; and Dok- 
tycz, etai (1 995) Anal. Biochem. 230 :205); this has been observed in severai reports dealing with biologicaily generated 
DNA from a polymerase chain reaction (PGR) amplification (Tang, etai. (1994) Rapid Commun. Mass Spectrom. 8: 
183; Liu. ef a/. (1995) Anai. Chem. 67 :3482; Siegert etai (1996) Anal. Biochem. 243 :55; and Doktycz, efa/-(1995) 
Anai. Biochem. 230:205). It is not clear whether the double strand is destabilized because of the decreased pH in the 
matrix environment or because of absorbance by the duplex during desorption/i on izati on/acceleration of an energy 
sufficient to overcome the attractive van der Waals and "stacking" stabilization forces (Cantor and Shimmei, Biophysical 
Chemistry Part \: The conformation of Biomolecules . W.H. Freeman, New York, (1980), 176). When analyte is present 
at high concentrations fonnation of non-specific gas-phase DNA multimers is, as with proteins (Karas, et. ai., (1989) 
int. J. MassSpectrom, Ion Processes 92:231). common; however, Lecchiand Pannell (Lecchi etai (1995) J. Am. Soc. 
Mass Spectrom. 6:972) have provided strong evidence for specific V^stson Crick (WC) base pairing being maintained 
in the gas phase. They detected these specific dimers when using 6-aza-2-thiothymlne as a matrix, but did not observe 
them with 3-hydroxypicclinic acid (3-HPA) or 2,4,6- hydroxyacetophenone matrix. As described below, by using a low 
acceleration voltage of the ions and preparing samples for MALDI analysis at reduced temperatures, routine detection 
of dsDNA is possible. 

MATERIALS AND METHODS 

[0505] Synthetic DNA. Oligonucleotides were synthesized (Sinha, et ai. (1 984) Nucteic Acids Res., 12, 4539) on a 
Perspective Expedite DNA synthesizer and reverse phase HPLC purified in-house. Sequences were: 50-mer (15337 
Da): 5'-TTG CGT ACA CAC TGG CCG TCG TTT TAC AAC GTC GTG ACT GGG AAA ACC CT-3' (SEQ ID NO. 1 09); 
27-mer^ (complementary, 8343 Da): 5'-GTA AAA CGA CGG CCAGTGTGTACG CAA-3' (SEQ ID NO. 110); 27-mern^ 
(non -complementary, 8293 Da): 5'-TACTGG AAG GCG ATCTCAGCA ATC AGC-3' (SEQ ID NO. 111). lOO^tM stodk 
soiutiona were diluted to 20, 10, 5, and 2.5 using ISMohnn/cm H2O. 2^L each of equimolar solutions of the 50-mer 
and either 27-merc or 27-mernc were mixed and allowed to anneal at room temperature for 1 0 minutes. 0.5p.L of these 
mixtures were mixed directly on a sample target with 1 ^iL matrix (0.7 M 3-HPA, 0.07 M ammonium citrate in 50% 
acetonitrile) and allowed to air dry. 

Biological DNA. Enzymatic digestion of human genomic DNA from leukocytes was performed. PGR primers (forward, 
5'-GGC ACG GCT GTC CAA GGA G-3' (SEQ ID NO. 112)); reverse, 6'-AGG CCG CGC TCG GCG CCC TC-3' (SEQ 
ID NO. 113) to amplify a pori:ion of exon 4 of the apo lipoprotein E gene were delineated from the published sequence 
(Das et ai., (1985) J. Bioi Chem,, 260 6240). Taq polymerase and lOx buffer were purchased from Boehringer-Man- 
nheim (Gemnany) and dNTPs from Pharmacia (Freiburg, Germany). The total reaction volume was 50 ^lI including 20 
pmol of each primer and 1 0% DMSO (dimethyisulfoxide, Sfgma) with approximately 200 ng of genomic DNA used as 
template. Solutions were heated to 80*C before the addition of lU polymerase; PGR conditions were: 2 min at 94*C, 
followed by 40 cycles of 30 sec at 94'C, 45 sec at 63*C, 30 sec at 72*C, and a final extension time of 2 mm at 72''c! 
While no quantitative data was collected to determine the final yield of amplified product, it is estimated that -2pmol 
were available for the enzymatic digestion. 

[0506] Cfol and Rsal and reaction buffer L were purchased from Boehringer-Mannheim. 20^1 of amplified products 
were diluted with 1 5^1 water and 4^1 buffer L; after addition of 1 0 units of restriction enzymes the samples were incubated 
for 60 min at 37*C. For precipitation of digest products S^il of 3M ammonium acetate (pH 6.5). (5|^l glycogen (Braun. 
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etaf. (1997) Clin. Chem. 43:1151) (lOmg/ml, Sigma), and IIO^jlI absolute eth an o I were added to 50^L of the analyte 
solutions and stored for 1 hour at room temperature. After at 1 0 min centrifugation at 13,000 X g, the pellet was washed 
in 70% ethanol and resuspended in 18Mohm/cm H2O. 

[0507] Sampte preparation andanafysis by MALDf-TOF MS, 0.35[il of resuspended DNA was mixed with 0.35-1 .3 
I^L matrix solution (0.7M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1;1 H20;CH3CN) (Wu, et al. 
(1 993) Rapid Commun. Mass Spectrom. 7:142) on a stainless steel sample target disk and allowed to air dry preceding 
spectrum acquisition using a Thermo Bioanaiysis Vision 2000 MALDI-TOF instrument operated In pitive ion reflectron 
mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular masses (M^ 
(cafe)) of the fragments were calculated from atomic compositions; the mass of a proton (1 .06 Da) was subtracted from 
raw data values in reporting experimental moJecular masses (M^texp)) as neutral basis. External calibration generated 
from eight peaks (2000-18000 Da) was used for all spectra. 

Results and Discussion 

[0508J Figure 96A is a MALDI-TOF mass spectrum of a mixture of the synthetic 50-mer with (non-complementary) 
27*merji^ (each 1 0 i^M, the highest final concentration used in this study); the laser power was adjusted to just above 
the threshold in-adlation for ionization. The peaks at 8.30 and 15.34 kDa represent singly charged ions derived from 
the 27- and 50-mer single strands, respectively. Poorly resolved low intensity signals at -1 6.6 and -30.7 kDa represent 
homodimers of 27-and 50-mer, respectively; that at 23.6 kOa is consistent with a heterodimer containing one 27-mer 
and one 50-mer strand. Thus low intensity dimer Ions representing all possible combinations from the two non-com- 
plementary oligonucleotides (27-1-27; 27+50; 50+50) were observed. Increasing the irradiance even to a point where 
depurination peaks dominated the spectrum resulted In sifghtly higher intensities of these dimer peaks. Note that the 
hybridization was perfomned at room temperature and with a very low salt concentration, conditions at which non- 
specific hybridization may occur. 

[0509] Figure 96 shows a MALDI-TOF spectrum of the same 50-mer mixed with (complementary) 27-mer^,; the final 
concentration of each oligonucleotide was again 1 0^iM. Using the same laser power as In Figure 96A, fntense signals 
were again observed at 88.34 and 1 5.34 Kda, consistent with single stranded 27- and 50-mer, respectively. Homodimer 
peaks (27 + 27; 50 + 50) were barely apparent in the noise; however, sfngly (23.68 Kda) and doubly ( 1 1 .B4k Da) charged 
heterodfmer {27 + 50) peaks were dominant. Afthough the 23,68 Kcfa dimer peak could be detected from all irradiated 
positions, its intensity reiattve to the monomer peaks varied slightly from spot-to-spot, Repeating the experiment with 
individuai oiigonucfeotide concentrations of 5, 2.5, and 1.25 ^M resulted in decreasing amounts of the 27-/50-mer 
Watson-Crick dimer peak relative to the 27- and 50-mer single stranded peaks. At the lowest concentrations, the ob- 
servation of dimer was "crystal-dependent", that is, irradiation of some crystals produced significant 27-/50-mer dimer 
signal, while other crystals reproducibly yielded very little or none. This indicates that the tncorporation of dsdna Into 
the matrix crystals or the effectiveness of retaining this interaction through the ionlzalion/desorption process is depend- 
ent upon the microscopic properties of the crystals, and/or that there exist steep concentration gradients of the duplex 
throughout the sample. 

[0510] Thus the Figure 96 spectra provide strong evidence that specific WC base pafred dsdna can be observed 
using gentle laser conditions with high concentrations of oligonucleotides in this mass range, the first report of this 
□sing a 3-HPA matrix. The study was extended to a complex mixture of dsdna derived from an enzymatic digest (Rsal/ 
Cfol) of a region of axon 4 of theapoiipoprotein E gene (Das et. al., (1985) J. Bioi Chem., 260 6240); expected fragment 
masses are given in Table iX. 


Table IX 


Cfol/Rsal Digestion Products from ApoE gene exon 4^ 

bases^ 

ssDNA 

(Da) 

dsdna (Da) 

(+) (-) 

(+) 

(-) 


11 13 

3428 

4025 

7453 

16 

5004 

4924 

9928 

18 

5412 

5750 

11162 

17 19 

5283 

5880 

11163 

19 

5999 

57S1 

11780 


»E3 allele has no 17/19 or t9/19 pairs; e4 allele contains nq 36/38 pair. 
'(+) sense strand, (-) antlsense strand 
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Table IX (continued) 


Cfol/RsaJ Digestion Products from ApoE gene exon 4^ 

bases*^ 

ssDNA 

(Da) 

dsdna (Da) 

24 

22 

7510 

6745 

14225 

31 

29 

9628 

9185 

18813 

36 

38 

11279 

11627 

22906 

48 

14845 

14S58 

29703 

55 

53 

17175 

16240 

33415 


*e3 allele has no 17/19 or 19/19 pairs; e4 allele contains no 3B/3a pair. 
^(+) sertse strand, (-) antlsense strand 


After the digestion step, the sannples were purified and concentrated by ethanol precipitation and resuspended In 1^JLL 
HjO before mixing them at room temperature with matrix on the sampJe target. Nearly 20 peaks ranging in mass from 
3.4-1 7.2 Kda were resofved In the products' MALD! spectrunn (Figure 97A), all consistent with denatured single stranded 
components of the double strand (Tabfe IX). Many such analyses of similar biological products over a period of months 
also yielded spectra with negligible dsdna, consistent with previous reports (Tang, etaL (1 994) Rapid Commun, Mass 
Spectrom, 8:183; Liu, etal. (1995) Anal. Chem. 67:3482; Siegert etaL (1996) Anal. Biochem. 243 :55; and Doktycz, et 
af. (1995) Anal. Biochem. 230:205); contrarily, intact double strands were observed under similar conditions for the 
synthetic DNA (Figure 96A). It is difficult to estimate the strand concentration available after the biological reactions, 
but presunnably that ft was far lower than that at which dimerization of synthetic samples occurred. Furthermore, main- 
taining speciffc hybrids within the two-component synthetic mi)fture may be kinetically favored relative to the far more 
complex mixture of 20 single-stranded DNA components from the digest. 

[0511] The effect of reduced temperature on maintaining dsDMA was tested. An aliquot of the digested DNA solution, 
the matrix, pipette, pipette tips, and the stainless steel sample target were stored in a 4*C "cold room" for 15 minutes; 
as with normal preparations matrix, and then analyte, were spotted on the target and allowed to co-crystallize while 
air drying. Crystallization for mixtures of 300 nL 3HPA (50% acetonitrrle) with 300 nL analyte required - 1 minute at 
room temperature but -1 5 minutes at the reduced temperature. Sample spots prepared in the cold room environment 
typically contained a high proportion of large transparent crystals. 

[0512] MALDI-TOF analysis of an ApoE digest aliquot prepared at reduced temperature produced the Figure 97B 
spectrum. While the low mass range appeared qualitatively similar to Figure 97A, dramatic differences above 8 kDa 
were observed. Only signals consistent with single strands (Table IX) were observed in Figure 97A, but the Figure 97B 
cold room prepared samples did not yield signals for the same masses except below 8 kDa. Even more striking were 
the additional high mass peaks In Figure 97B; clearly these represent dimer peaks containing lower mass components. 
As was done with the synthetic DNA, it was important to determine whether these represent non-specific heterodimers, 
specific WC heterodimers, or nonspecific homodimers. Consider first the 33.35 kDa fragment. Ignoring the unlikely 
possibility that the high mass fragment represents a trimer or higher multimer, as a dimer it must only contain the 
highest mass ssDNA components, Le., the >16 kDa. Homodimerization of the 15.24 and 17.18 kDa fragments would 
result in 32.49 and 34.35 kDa peaks, respectively; corresponding mass errors for these incorrect assignments relative 
to the observed 33.35 kDa would be -2.6% and +3.0% respectively. A far better match is achieved if this peak originates 
from a heterodimer of the two highest mass single stranded fragments; their summed mass (1 6,24-t-1 7.1 8 = 33.42 kDa) 
differed by 0.2% from the observed dimer mass 33.35 kDa, an acceptable mass error for MALDI-TOF analysis of large 
DNA fragments using external calibration. Likewise, the 29.B6kDa fragment was measured only 0.13% lower than the 
29.70 Da expected for a heterodimer of 48-mers; the sum of no other possible homodimers or heterodimers were within 
a reasonable range of this mass. Similar arguments could be madefor the 22.89 and 1 8.83 kDa fragments, representing 
36-/38-mer and 31 -/29-heterodimers, respectively; the signal at 14.86 kDa is consistent with singly charged single 
stranded and doubly charged double-stranded 4B-mer. The agreement of the Figure 978 masses above 15 kDa with 
the of dsDNA expected from this digest and the absence of homodimers and non-specific heterodimer at random 
masses indicated that the base pairings were indeed highly specific and provided further evidence that gas-phase WC 
interactions may be retained in MALDI-generated ions. 

[0513] Figure 98 shows a MALDI-TOF spectrum of an G4 allele, which, unlike the G3, was expected to yield no 36-/ 
38-mer pair upon Cfol/Rsal digestion. The G3 and G4 mass spectra were similar except that abundant 22.89 kDa 
fragment in Figure 97B was not present in Figure 98; with this infomnation alone (Table IX) G3 and G4 alleles were 
easily distinguished, thereby demonstrating the genotyping by direct measurement of dsDNA by MALDI-TOF MS. 
Similarly dsDNA could be ionized, transferred to the gas phase, and detected by MALDI-TOF MS. The acceleration 
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voJtage typically employed on our instrument was only - 5kV corresponding to 1 .5kV/mm up to -2 mm from the sample 
target, with the electric field strength decreasing rapidly with distance from the sample target. Most previous work used 
at least 20kV acceleration (Lecchi etai, (1 995) J. Am. Soc. Mass Spectrom. 6:972); in one exception a 27-mer dsDNA 
was detected using a frozen matrix solution and 1 00 V acceleration (Nelson, etaL (1 990) Rapid Commun. Mass Spec- 
trom. 4:348). Without being bound by any theory MALDI-induced "denatu ration" of dsDN A may be due to gas-phase 
collisional activation that disrupts the WC pairing when high acceleration fields are employed, analogous to the dena- 
turation presumed to be a first step in the fragmentation used for sequencing the single stranded components of dsDNA 
using electrospray ionization (McLafferty etai. (1 996) Int. J. Mass Spectrom., Ion Processes ), It appears that the high 
sail concentrations (typically > lOmM NaCI or KC() required to stabilize WC paired dsDNA in solution are unsuitabie 
for MALDi analysis (Nordhoff et aL (1 993) ISfucleic Acids Res. 21 :3347); reducing the concentration of such non-volatile 
cations Is necessary to avoid catlon-adducted MALDI signals, but destabilizes the double strands in solution. The low 
pH conditions of the matrix environment should also destabilize the duplex. As shown In Figures 97B and 96, storing 
and preparing even low concentrations of the biological samples at reduced temperature at least in part offset these 
denaturing effects, especially for longer strands where melting temperatures are hfgher due to a more extensive hy- 
drogen bonding network. The conditions used here are recognized to be very non-stringent annealing conditions. 
[0514J The low mass tails on hrgh mass dsDNA peaks (e^L, Figure 97B, 232 kDa) are consistent with depurination 
generated to a higher extent than the sum of depurination from each of the single strands combined. Although depu- 
rination in solution is an acid-catalyzed reaction, the weakly acidic conditions in the 3-HPA matrix do not Induce sig- 
nificant depurination; molecular Ion signals from a mixed-base 50-mer measured with De-MALDI-TOF had only minor 
contributions from depurination peaks (Juhaz, etat. (1996) AnaLChem. 68:941). Depunnation from the single stranded 
components of the gas-phase dsDNA fs observed even though these bases are expected to be hydrogen bonded to 
the complementary base of the accompanying strand, implying that covalent bonds are being broken before the strand 
Is denatured. 

EXAMPLE 27 

Efficiency and Specificity Assay for Base-Specific Ribonucfeases 

[0515] Aliquots sampled at regular time intervals during digestion of selected synthetic 20 to 25 mers were analyzed 
by mass spectrometry. Three of the RNAses were found to be efficient and specific. These include: the G-specific T^, 
the A-specific and the A/U-specific PhyM. The ribonucleases presumed to be C-specific were found to be less 
reiiabfe, e^, did not cfeave at every C or also cleaved at U in an unpredictable manner. The three promising RNAses 
all yielded cleavage at a/i of the predicted positions and a complete sequence coverage was obtained. In addition, the 
presence of cleavage products containing one or several uncteaved positions (short incubation times), aiiowed align- 
ment of the cleavage products. An example of the MALDi -spectrum of an aliquot sampled after T^ digest of a synthetic 
20'mer[SEQ ID NO:114] RNA Is shown in Figure 100. 

EXAMPLE 28 

Immobilization of amplified DNA targets to silicon waters 
Silicon surface preparation 

1051 6] Silicon wafers were washed with ethanol, flamed over bunsen burner, and immersed In an anhydrous solution 
of 25% (by volume) S-amtnopropyitriethoxysltane in toluene for 3 hours. The silane soiution was then removed, and 
the wafers were washed three times with toluene and three times with dimethyl sulfoxide (DMSO). The wafers were 
then incubated in a 1 0mM anhydrous solution of N-succinimidyl {4-iodoacetyl) aminobenzoate (SEAB) (Pierce Chemical, 
Rockford, IL) In anhydrous DMSO. Followingthereaction, the SIAB solution was removed, andthewafers were washed 
three times with DMSO. In all cases, the iodoacetamido-functionallzed wafers were used immediately to minimize 
hydrolysis of the labile iodoacetamido-functionailty. Additionally, all further wafer manipulations were performed in the 
dark since the iodoacetamido-f unction ality is light sensitive. 

Immobilization of amplified thiol-contalnlng nucleic acids 

[0517] The SIAB-conjugated silicon wafers were used to analyze specific free thiof-contalning DNA fragments of a 
particuiar amplified DNA target sequence. A 23-mer oligodeoxynucfeotide containing a S'-disulfide linkage [purchased 
from Operon Technologies; SEQ ID NO: 117] that is complementary to the 3'-reglon of a 112 bp human genomic DNA 
template [Genebank Acc. No.: Z52259; SEQ fD NO: 118] was used as a primer in conjunction with a commercially 
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availabJe 49-nner primer, which is complementary to a portion of the 5'-end of the genomic DN A [purchased from Operon 
Technologies; SEQ ID NO: 119], in PGR reactions to amplify a 135 bp DNA product containing a 5'-dlsulflde linkage 
attaclied to only one strand of the DNA duplex [SEQ ID NO: 120]. 

[0518] The PGR amplification reactions were perfonned using the Amplitaq GoidKit [Perkin Elmer Catalog No. 
N808-0249]. Briefly. 200 ng 112 bp human genomic DNA template was incubated with 10 |iM of 23-mer primer and 
B\lM of commercially available 49-mer primer, 10 mM dNTPs^ 1 unit of Amplitaq Gold DNA polymerase in tlie buffer 
provided by the manufacturer and PGR was performed in a themnocycler. 

[0519] The 5'-disulfide bond of the resulting amplified product was fully reduced using 1 0 mM tris-(2-cart)oxyethyf) 
phosphine(TCEP) (Pierce Chemical, Rockford, IL) to generate a free S'-thfol group. Disulfide reduction of the modified 
oligonucleotide was monitored by observing a shift in retention time on reverse-phase FPLC. It was detennined that 
after five hours in the presence of 1 0 mM TCEP, the disulfide was fulJy reduced to a free tJiiol. Immediately following 
disulfide deavage. the modified oligonucleotide was incubated with the iodacetamido-functionalized wafers and con- 
jugated to the surface of the silicon waferthrougl^ the SIAB linker. To ensure complete thiol deprotonation, the coupling 
reaction was performed at pH 8.0. Using 1 0mM TCEP to cleave the disulfide and the other reaction conditions described 
above, it was possible to reproducibly yield a surface density of 250 fmol per square mm of surface. 

Hybridization and MALDI-TOF Mass spectrometry 

[0520] The silicon wafer conjugated with the 135 bp thiol-containing DNA was incubated with a complementary 
12-mer oligonucleotide [SEQ ID NO; 121] and specifically hybridized DNA fragments were detected using MALDI-TOF 
MS analysis. The mass spectrum revealed a signal with an observed experimental mass-to-charge ratio of 3618.33; 
the theoretical mass-to-charge ratio of the 1 2-mer oligomer sequence is 3622.4 Da. 

[0521] Thus, specific DNA target molecule that contain a S'-disutfide linkage can be amplified. The molecules are 
immobilized at a high density on a SIAB-derivatized silicon wafer using the methods described herein and specific 
complementary oligonucleotides may be hybridized to these target molecules and detected using MALDJ-TOF MS 
analysis. 

EXAMPLE 29 

Use of High Density Nucleic Acid Immobilization to Generate Nucleic Acid Arrays 

[0522] Employing the high density attachment procedure described in EXAMPLE 28, an array of DNA oligomers 
amenable to MALDI-TOF mass spectrometry analysis was created on a silicon wafer having a plurality of locations, e. 

depressions or patches, on its surface. To generate the array, a free thioi-containing oligonucleotide primer was 
immobilized only at the selected locations of the wafer [e.g., see EXAMPLE 28]. The each location of the array contained 
one of three different oligomers. To demonstrate that the different immobilized oligomers could be separately detected 
and distinguished, three distinct oligonucleotides of differing lengths that are complementary to one of the three oli- 
gomers were hybridized to the array on the wafer and analyzed by MALDI-TOF mass spectrometry. 

Oligodeoxy nucleotides 

[0523] Three sets of complementary oilgodeoxynucleotide pairs were synthesized in which one member of the com- 
plementary oligonucleotide pair contains a 3'- or 5'-disulflde linkage [purchased from Operon Technologies or ONgos, 
Etc.]. For example, Oligomer 1 [d(CTGATGCGTCGGATCATC I i I I I l-SS); SEQ ID NO: 122] contains a 3*-disulfide 
linkage whereas Oligomer 2 [d{SS-CCTCTTGGGAACTQTGTAGTATT); a 6'-disulfide derivative of SEQ ID NO: 117] 
and Oligomer 3 [d(SS-GAATTCGAGCTCGGTACCCGG); a 5' -disulfide derivative of SEQ ID NO: 115] each contain a 
5'-disulfide linkage. 

[0524] The oligonucleotides complementary to Oligomers 1 -3 were designed to be of different lengths that are easily 
resolvable from one another during MALDI-TOF MS analysis. For exampJe, a 23-mer oligonucieotide [SEQ ID NO: 
123] was synthesized complementary to a portion of Oligomer 1, a 12-mer oligonucieotide [SEQ ID NO: 121] was 
synthesized complementary to a portion of Oligomer 2 and a 21 -mer [SEQ ID NO: 11 6] was synthesized complementary 
to a portion of Oligomer 3. In addftion, a fourth 29-mer oligonucleotide [SEQ ID NO: 124] was synthesized that lacks 
complementarity to any of the three oligomers. This fourth oligonucleotide was used as a negative control. 
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Silicon surface chemistry and DNA immobilization 
(a) 4 X 4 (16-locatlon} array 

[0525] A 2 X 2 cm^ silicon wafer having 256 mdivjclua! depressions or wells in the fonn of a 1 6 X 16 weJI array was 
purchased from a commercial supplier [Accelerator Technology Corp., College Station, Texas], The wells were 800 X 
600 ^im^, 120 |xm deep, on a 1 .125 pilch. The silicon wafer was reacted with S-aminopropyttriethoxysifane to produce 
a uniform layer of primary amines on the surface and then exposed to the heterobifunctional crosslinker SI AB resulting 
in iodoacetamido functionalities on the surface [e.g. . see EXAMPLE 28], 

[0526] To prepare the oligomers for coupling to the various locations of the silicon array the disulfide bond of each 
oligomer was fully reduced using 10 mM TCEP as depicted in EXAMPLE 28, and the DNA resuspended at a final 
concentration of 1 0]iM in asolution of 1 00 mM phosphate buffer, pH S.O. I mmediatefy following disulfide bond reduction, 
the free-thiol group of the oligomer was coupFed to the iodoacetamido functionality at 1 6 locations on the wafer using 
the probe coupling conditions essentially as described above in EXAMPLE 28. To accomplish the separate coupling 
at 16 distinct locations of the wafer, the entire surface of the wafer was not flushed with an oligonucleotide solution 
but, instead, an — 30-nl aliquot of a predetermined modified oligomer was added In parallel to each of 16 locations (i. 
e., depressions) of the 256 wells on the wafer to create a 4 x 4 array of immobilized DNA using a robotic pintool. 
[0527] The robotic pintool consists of 16 probes housed in a probe block and mounted on an X Y, Z robotic stage. 
The robotic stage was a gantry system which enables the placement of sample trays below the arms of the robot. The 
gantry unit itself is composed of X and Y arms which move 250 and 400 mm, respectively guided by brushless linear 
servo motors with positional feedback provided by linear optical encoders. A lead screw driven Z axis (50 mm vertrcal 
travel) is mounted to the xy axis slide of the gantry unit and is controlled by an in-line rotary servo motor with positional 
feedback by a motor-mounted rotary optical encoder. The work area of the system is equipped with a slide-out tooling 
plate that holds five microliter plates (most often. 2 plates of wash solutiort and 3 plates of sample for a maximum of 
11 52 different oligonucleotide solutions) and up to ten 20x20 mm wafers, The wafers are placed precisely in the plate 
against two banking pins and held secure by vacuum. The entire system Is enclosed in plexi-glass housing for safety 
and mounted onto a steel support frame for thermal and vibrational damping. Motion control is accomplished by em- 
ploying a commercial motion controller whtch was a 3-axis servo controller and is integrated to a computer; program- 
ming code for specific applications Is written as needed. 

[0528] To create the DNA array, a pintool with assemblies that have solid pin elements was dipped into 16 wells of 
a multi-well DNA source plate containing solutions of Oligomers 1-3 to wet the distal ends of the pins, the robotic 
assembly moves the pin assembly to thesiiicon wafer, and the sample spotted by surface contact. Th us , one of modified 
Oligomers 1 -3 was covalently immobilized to each of 1 6 separate wells of the 256 wells on the silicon wafer thereby 
creating a 4 x 4 array of immobilized DNA, 

[0529] In carrying out the hybridization reaction, the three complementary oligonucleotides and the negative control 
oligonucleotide were mixed at a final concentration of 1 0 |aM for each oligonucleotide in 1 ml of TE buffer [1 0 mM Tris- 
HCf, pH 8.0, 1 mM EDTA] supplemented with 1 M NaCI, and the solution was heated at 65*C for 10 min. immediately 
thereafter, the entire surface of the silicon wafer was flushed with 800 p.1 of the heated oligonucleotide sofution. The 
complementary oligonucleotides were annealed to the immobifized oligomers by incubating the silicon array at ambient 
temperature for 1 hr, followed by incubation at 4^*0 for at least 10 min. Altematively the oligonucleotide solution can 
be added to the wafer which is then heated and allowed to coo! for hybridization. 

[0530] The hybridized array was then washed with a solution of 50 mM ammonium citrate buffer for cation exchange 
to remove sodium and potassium ions on the DNA backbone (Pieles etaL, (1993) Nucl. Acids Res, 21:3191-31961 A 
6-nl aliquot of a matrix solution of 3-hydroxyp(collnic acid [0.7 M 3-hydroxypicolinic acid-10 % ammonium cftrate in 50 
% acetonitrile; see Wu et at. Rapid Commun. Mass Spectrom. 7: 1 42-1 46 (1 993)] was added in series to each location 
of the array using a robotic piezoelectric serial dispenser (Le., a piezoelectric pipette system). 
[0531] The piezoelectric pipette system is built on a system purchased from Microdrop GmbH, Norderstedt Germany 
and contains a piezoelectric element driver which sends a pulsed signal to a piezoelectric element bonded to and 
surrounding a glass capillary which holds the solution to be dispensed; a pressure transducer to load (by negative 
pressure) or empty (by positive pressure) the capillary; a robotic xyz stage and robot driver to maneuver the capillary 
for loading, unloading, dispensing, and cleaning, a stroboscope and driver pulsed at the frequency of the piezo element 
to enable viewing of 'suspended' droplet characteristics; separate stages for source and designation plates or sample 
targets (i.e. Si chip); a camera mounted to the robotic ami to view loading to designation plate; and a data station which 
controls the pressure unit, xyz robot, and piezoelectric driver 

[0532] The 3-HPA solution was allowed to dry at ambient temperature and thereafter a 6-nl aliquot of water was 
added to each location using the piezoelectric pipette to resuspend the dried matrix-DNA complex, such that upon 
drying at ambient temperature the matrix-DNA complex forms a uniform crystalline surface on the bottom surface of 
each location. 
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MALDI-TOF MS analysis 

[0533] The MALDI-TOF MS analysis was performed in series on each of tine 16 iocalions of the hybridization array 
iilustrated in Figure 6 essentialiy as described in EXAIVIPLE 28. The resuiting mass spectrum of oligonucieotides that 
speclficaliy hybridized to each of the 1 6 locations of the DNA hybridization revealed a specific signal at each location 
representative of observed experimental nnass-to-charge ratio corresponding to the specific complementary nucleotide 
sequence. 

[0534] For example, in the locations that have only Oligomer 1 conjugated thereto, the mass spectrum revealed a 
predominate signaf with an observed experimental mass-to-charge ratio of 7072,4 approximately equal to that of the 
23-mer; the theoretical mass-to-charge ratio of the 23-mer is 7072.6 Da. Similarly, specific hybridization of the 1 2-mer 
oligonucleotide to the array, observed experimental mass-to-charge ratio of 3618.33 Da (theoretical 3622.4 Da), was 
detected only at those locations conjugated with Oligomer 2 whereas specific hybridization of MJM6 (observed exper- 
imental mass-to-charge ratio of 641 5.4) was detected only at those locations of the array conjugated with Oligomer 3 
[theoretical 6407.2 Da]. 

[0535] None of the locations of the array revealed a signal that corresponds to the negative control 29-mer oligonu- 
cleotide (theoretical mass-to-charge ratio of 8974.8) indicating that specific target DNA molecules can be hybridized 
to oligomers covalently Immobilized to specific locations on the surface of the silicon array and a plurality of hybridization 
assays may be individually monitored using MALDI-TOF MS analysis. 

(b) 6 X 8 (64-location) array 

[0536] A 2 X 2 cm^ silicon wafer having 256 individual depressions or wells that form a 1 6 X 1 6 array of wells was 
purchased from a commercial supplier [Accelerator Technology Corp., College Station, Texas]. The wells were 800 X 
800 |im2, 120 jim deep, on a 1.125 pitch. The silicon wafer was reacted with 3-amlnopropyltrfethoxysflane to produce 
a uniform iayer of primary amines on the surface and then exposed to the heterobifunctfonal crosslinker SfAB resulting 
in iodoacetamido functionalities on the surface as described above. 

[0537] To make an array of 64 elements, a prntool was used following the procedures described above, The pmtool 
was dipped into 16 wells of a 384 well DNA source plate containing solutions of Oligomers 1-3, moved to the silicon 
wafer, and the sample spotted by surface contact. Next, the tool was dipped in washing solution, then dipped into the 
same 1 6 wells of the source plate, and spotted onto the target 2.25mm offset from the initial set of 16 spots; the entire 
cycle was repeated to malte a 2x2 array from each pin to produce an 8x8 array of spots (2x2 elements/pin X 16 pins 
= 64 total elements spotted). 

[0538] Oligomers 1 -3 immobilized to the 64 locations were hybridized to complementary olfgonucleotides and ana- 
lyzed by MALDI-TOF MS analysis. As observed for the 16-location array, specific hybridization of the complementary 
oligonucleotide to each of the immobilized thiol-containing oligomers was observed in each of the locations of the DNA 
array. 

EXAMPLE 30 

Extension of hybridized DNA primers bound to ON A templates immobilized on a silicon wafer 

[0539] The SIAB-derivatized silicon wafers can also be employed for primer extension reactions of the immobilized 
DNA template using the procedures essentialty described in EXAMPLE 7. 

[0540] A 27-mer oligonucleotide [SEQ ID NO: 1 25] containing a S'^free thiol group was coupled to a SIAB-derivatized 
silicon wafer as described above, for example, in EXAMPLE 28. A 12-mer oJigo nucleotide primer [SEQ ID NO: 1261 
was hybridized to the immobilized oligonucleotide and the primer was extended using a commercially available kit [e, 
g., Sequenase or ThermoSequenase, U.S. Biochemical Corp]. The addition of Sequenase DNA polymerase orTher- 
moSequenase DNA polymerase in the presence of three deoxyribo nucleoside triphosphates (dNTPs; dATP, dGTP, 
dCTP) and dideoxyribonucleoside thymidine triphosphate (ddTTP) in buffer according to the instmctlons provided by 
the manufacturer resulted in a 3-base extension of the 12-mer primer while still bound to the silicon wafer. The wafer 
was then analyzed by MALDI-TOF mass spectrometry as described above. The mass spectrum results clearly distin- 
guish the 15-mer [SEQ ID NO: 127] from the original unextended 12-merthus indicating that specific extension can 
be perfonned on the surface of a silicon wafer and detected using MALDI-TOF MS analysis. 
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EXAMPLE 31 

Effect of linker length on polymerase extension of hybridized DNA primers bound to DNA templates 
immobUized on a silicon wafer 

[0541] Tiie effect of the distance between the SIAB-conjugated silicon surface and tlie duplex DNA formed by hy- 
bridization of the target DNA to the immobiJized oligomer template was investigated, as well as choice of enzyme. 
[0542] TwoSIAB-derlvatized silicon wafers were conjugated to the 3' -end oftwofreethioj-containing oligonucieotides 
of identical DNA sequence except for a 3-base poly dT spacer sequence incorporated at the 3'-end: 

CTGATGCGTC QQ ATCATCTT TTTT SEQ ID No. 122 

CTGATGCGTC GGATCATCTT Ti l I I IT SEQ ID No. 125. 

These oligonucleotides were synthesized and each was separately innmobiiized to the surface of a silicon wafer through 
the SIAB cross-linker [ e.g. , see EXAMPLE 28]. Each wafer was incubated with a 12-mer oligonucleotide: 

AAAAAAGATG AT SEQ ID No. 126 

GATGATCCGA CG SEQ ID No- 128 

GATCCGACGC AT SEQ ID No, 129, 

which is complementary to portions of the nucleotide sequences common to both of the oligonucleotides, by denaturing 
at 75 *C and slow cooling the silicon wafer. The wafers were then analyzed by MALDI-TOF mass spectrometry as 
described above. 

[0543] As described in EXAMPLE 30 above, a 3-base specific extension of the bound 1 2-mer oligonucleotide was 
observed using the oligomer primer where there is a 9-base spacer between the duplex and the surface [SEQ ID NO; 
1 25]. Similar resutts were observed when the DNA spacer lengths between the SIAB moiety and the DNA duplex were 
0, 3, 6 and 12. In addition, the extension reaction may be perfomned using a variety of DNA polymerases, such as 
Sequenase and Thermo Sequenase (US Biochemical). Thus, the SIAB linker may be directly coupled to the DNA 
template or may include a linker sequence without effecting primer extension of the hybridized DNA. 

EXAMPLE 32 

Spectrochip mutant detection Jn ApoE gene 

[0544] This example describes the hybridization of an immobilized template, primer extension and mass spectrometry 
for detection of the wildtype and mutant Apoiipoprotein E gene for diagnostic purposes. This example demonstrates 
that immobilized DNA molecules containing a specific sequence can be detected and distinguished using primer ex- 
tension of unlabeled allele specific primers and analysis of the extension products using mass spectrometry. 
[0545] A 50 base synthetic DNA template complementary to the coding sequence of allele 3 of the wildtype apoii- 
poprotein E gene: 

5'- GCCTGGTACACTGCCAGGCGCTTCTGCAGGTCATCGGCATCGCGGAGGAG -3' 
(SEQ ID NO: 280] 

or complement to the mutant apoiipoprotein E gene canning a G -*A transition at codon 158: 

5''GCCTGGTACACTGCCAGGCACTTCTGCAGGTCATCGGCATCGCGGAGQAQ-3' 
[SEQ ID NO: 281] 
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containing a 3'-free thiol group was coupled to separate SiAB-derivatized silicon wafers as described in Example 28. 
[0546] A 21 -mer oligonucleotide primer: 

5'-GAT GCC GAT GAC CTG CAG AAG-3' [SEQ ID NO: 282] was hybridized to each of the immobilized templates and 
the primer was extended using a commercially availabJe Icit [e.g., Sequenase orThennosequenase, U.S. Biochemical 
Corp], The addition of Sequenase DNA polymerase or Therm osequenase DNA polymerase in the presence of three 
deoxyribonucieoside triphosphates (dNTPs; dATP, dGTP, dTTP) and dideoxyribonucJeoside cytosine triphosphate (dd- 
CTP) in buffer according to the instructions provided by the manufacturer resulted in a single base extension of the 
21 -mer primer bound to the immobilized template encoding the wildtype apolipoprotein E gene and a three base ex- 
tension of the 21 -mer primer bound to the immobilized template encoding the mutant form of apolipoprotein E gene. 
[0547] The wafers were analyzed by mass spectrometry as described herein. The wiJdtype apolipoprotein E se- 
quence results in a mass spectrum that distinguishes the primer with a single base extension (22-mer) with a mass to 
charge ratio of 6771 .1 7 Da (the theoretical mass to charge ratio is 6753.5 Da) from the original 21 -mer primer with a 
mass to charge ration of 6499.64 Da. The mutant apolipoprotein E sequence results in a mass spectrum that distin- 
guishes the primer with a three base extension (24-mer) with a mass to charge ratio of 7386.9 (the theoretical mass 
charge is 7386.9) from the origlnaJ 21 -mer primer with a mass to charge ration of 6499.64 Da. 

EXAMPLE 33 

Detection of Oouble-Stranded Nucleic Acid Molecules via Strand Displacement and HybridrzatJon to an 
Immobilized CompJementary Nucleic Acid 

[0548] This example describes immobilization of a 24-mer primer and the specific hybridization of one strand of a 
duplex DNA molecule, thereby permitting amplication of a selected target molecule fn solution phase and permitting 
detection of the double stranded molecule. This method is useful for detecting single base changes, and, particularly 
for screening genomic libraries of double-stranded fragments. 

[0549] A 24-mer DNA primer CTGATGCGTC GGATCATCTT TTTT SEQ JD No. 1 22, containing a 3'-free thiol group 
was coupled to a SIAB -derivatized silicon wafer as described in Example 29. 
[0550] An 18-mer synthetic oligonucleotide: 

5'-CTGATGCGTCGGATCATC-3' [SEQ ID NO: 286] was premixed with a 1 2-mer 5'-GATGATCCGACG-3' [SEQ ID NO: 
285] that has a sequence that is complementary to 12 base portion of the 1 8-mer ollgonucieotide. The oligonucleotide 
mix was heated to 75*C and cooled slowly to room temperature to faciliate the formation of a duplex molecule: 

S' ^CTGATGCGTCGGATCATC-3' [SEQ ID NO. 286] 
3' - GCAGCCTAGTAG-5' [SEQ ID NO: 287]. 

[0551] The specific hybridization of the 12-mer strand of the dupiex molecule to the immobilized 24-mer primer was 
carried out by mixing 1]iM of the duplex molecule using the hybridization conditions described in Example 30. 
[0552] The wafers were analyzed by mass spectrometry as described above. Specific hybridization was detected in 
a mass spectrum of the 1 2-mer with a mass to charge ratio of 3682.78 Da. 

EXAMPLE 34 

1 -{2-Nitro-5-(3-0-4,4'-diniethoxytritylpropoxy)phenyl)-1-0-((2-cyarioethoxy)-dJisopropylamlnopho 
ethane 

A. 2-Nltro-5-(3-hydroxypropoxy)benzaldehyde 

[0553] 3-Bromo-1-propanol (3.34 g, 24 mmoJ) was refluxed in 80 ml of anhydrous acetonitrile with 5-hydroxy-2-nj- 
trobenzaldehyde (3.34 g, 20 mmol), K^COa (3.5 g). and Kl (100 mg) overnight (15 h). The reaction mixture was cooled 
to room temperature and 150 ml of methylene chloride was added. The mixture was filtered and the solid residue was 
washed with methylene chlonde. The combined organic solution was evaporated to dryness and redissoived in 100 
ml methyiene chloride. The resulted solution was washed with saturated NaCI solution and dried over sodium sulfate. 
4.31 g (96%) of desired product was obtained after removal of the solvent in vacuo. 
R| - 0.33 (dichloromethane/methanol, 95/5). 

UV (methanol) maximum: 313, 240 (shoulder), 215 nm; minimum: 26B nm. 

NMR (DMSO-de) 6 1 0.28 (s. 1 H), 8.17 (d. 1 H), 7-35 (d, 1 H), 7.22 (s, 1 H), 4.22(t. 2H), 3.54 (t, 2H), 1 ,90 (m, 2H). 
NMR (DMSO-dfi) 5189.9, 153.0, 141.6, 134.3, 127.3, 11fi.4, 114.0, 66.2, 56.9, 31,7. 


92 


EP 1 164 203 A2 


B . 2- N itro-5-(3-0-t-bu ty Idimethy I s i ly Ip ropoxy)benzaldehyde 

[0554] 2-Nitfo-5-(3-hydroxypropoxy)benzaldehyde(1 g, 4.44 mmol) was dissolved in 50 ml anhydrous acetonitrile. 
To this solution, it was added 1 ml of triethylamine. 200 mg of imidazole, and 0.8 g (5.3 mmol) of tBDMSCL The mixture 
was stirred at room temperature for 4 h. Methanol (1 ml) was added to stop the reaction. The solvent was removed in 
vacuo and the solid residue was redissolved in 100 ml methylene chloride. The resulted solution was washed with 
saturated sodium bicarbonate solution and then water. The organic phase was dried over sodium sulfate and the 
solvent was removed in vacuo. The crude mixture was subjected to a quick silica gel column with methylene chloride 
to yield 1 .44 g (96%) of 2-nitro-5-(3-0-t-butyldimethyJsilylpropoxy)ben2aldehyde. 
R( = 0.67 (hexane/ethyl acetate, 5/1). 

UV (methanol), maximum: 317, 243, 215 nm; minimum: 235, 267 nm. NMR (DMSO-dg) 5 10.28 (s, 1H), 8.14 (d, 
1 H), 7.32 (d, 1 H), 7,20 (s, 1 H), 4.20 (t. 2H). 3.75 (t, 2H), 1 .90 (m, 2H), 0.85 (s, 9H), 0.02 (s, 6H). NMR (DMSO-dg) 
5 189.6, 162.7, 141.5, 134.0, 127.1, 118.2, 113.8, 65.4, 58.5, 31.2, 25,5, -3.1, -5.7. 

C. 1 -(2-Nltro-5-(3-0-Nbutyidimethylsilylpropoxy)phenyl)ethanol 

[0555] High vacuum dried 2-nitro-5-(3-0-t-butyldimethylsilylpropoxy)benzaldehyde (1 .02 g, 3 mmol) was dissolved 
50 ml of anhydrous methylene chloride, 2 M Trimethylaluminium in toluene (3 ml) was added dropwise within 10 min 
and l^eeped the reaction mixture at room temperature. It was stirred further for 1 0 min and the mixture was poured into 
10 ml ice cooled water. The emulsion was separated from water phase and dried over 100 g of sodium sulfate to 
remove the remaining water. The solvent was removed in vacuo and the mixture was applied to a silica gel column 
with gradient methanol in methylene chloride, 0.94 g (86%) of desired product was isolated. 
Rf ^ 0.375 (hexane/ethyl acetate, 5/1). 

UV (methanol), maximum: 306,233, 206 nm; minimum: 255, 220 nm. ""H NMR (DMSO-dg) 6 8.00 (d, 1H). 7.36 (s, 1H), 
7.00 (d. 1 H), 5.49 (b, OH), 5.31 (q, 1 H), 4. 19 (m. 2H), 3.77 (t, 2H), 1 .95 (m, 2H), 1 .37 (d, 3H), 0.86 (s, 9H), 0.04 (s, 6H). 
■"^C NMR (DMSO-dg) S 162.6, 146.2, 139.6, 126.9, 112.9, 112.5, 64.8, 63.9, 58.7, 31.5, 25,6, 24.9, -3.4, -5.8. 

D . 1 -(2-Nlt ro-5-(3< hydroxy p ropoxy)pheny l)eth a no I 

[0556] 1 -{2-Nitro-5-(3-0-t-butyldimethylsilylpropoxy)phenyl)ethanol (0.89 g, 2.5 mmol) was dissolved in 30 ml of THF 
and 0.5 mmol of nBu^NF was added under stirring. The mixture was stirred at room temperature for 5 h and the solvent 
was removed In vacuo. The remaining residue was applied to a silica gel column with gradient methanol in methylene 
chloride. 1-(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanoi (0.6 g (99%) was obtained. 
Rf=0.17 (dichloromethane/methanol, 95/5). 

UV (methanol), maximum: 304, 232, 21 0 nm; mmimum: 255, 21 9 nm, iH NMR (DMSO-dg) 6 B.OO (d, 1 H), 7.33 (s, 1 H), 
7.00 (d, 1 H), 5.50 (d, OH), 5.28 (t, OH), 4.59 (t, 1 H), 4.1 7 (t, 2H), 3.57 (m, 2H), 1 .89 (m, 2H), 1 .36 (d, 2H). 
^^C NMR (DMOS-dg) 6 162.8, 146.3. 139.7, 127.1 , 113.1, 112.6, 65.5, 64.0, 57.0, 31.8, 25.0. 

E. 1-(2-N]tro-5-(3-0-4,4'-dimethoxytritylpropoxy}phenyl)ethanol 

[0557] 1 -(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol (0.482 g, 2 mmol) was co-evaporated with anhydrous pyridine 
twice and dissolved in 20 ml anhydrous pyridine. The solution was cooled in ice-water bath and 750 mg (2.2 mmol) of 
DMTCI was added. The reaction mixture was stirred at room temperature overnight and 0.5 ml methanol was added 
to stop the reaction. The solvent was removed In vacuo and the residue was co-evaporated with toluene twice to 
remove trace of pyridine. The final residue was applied to a silica gel column with gradient methanol In methylene 
chloride containing drops of triethylamine to yield 0.96 g (89%) of the desired product 1 -(2-nftro-5-(3-0-4p4'-dimethox- 
ytrityl-propoxy)phenyl)ethanoL 
R,=0,50 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 350 (shoulder), 305, 283, 276 (shoulder), 233, 208 nm; minimum: 290, 258, 220 nm. 

^H NMR (DMSO-dg) 6 B.OO (d, 1H), 6.82-7.42 (ArH), 5.52 (d, OH), 5.32 (m, 1H), 4.23 (t, 2H), 3.71 (s, 6H), 3.17 (t, 2H), 

2.00 (m, 2H), 1.37 (d, 3H). 

13C NMR (DMOS-dg) 5 162,5, 157.9, 157.7, 146.1. 144.9, 140.1, 139.7, 135.7, 129.5, 128.8, 127.6, 127.5, 127.3, 
126 9, 126.4, 113.0, 112.8, 112.6, 85.2, 65.3, 63.9, 59.0, 54.8, 28.9, 24.9. 

F 1-(2-Njtro-5'{3-0-4,4-dlmethoxytrityJpropoxy)phenyl)-1-0 ((2-cyanoethoxy)-dii»opropyiaminophosp 
ethane 

[0558] 1 -(2-Nitro-5-(3-0-4,4'-dimethoxytritylpropoxy)phenyl)ethanol (400 mg. 0.74 mmol) was dried under high vac- 
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uum and was dissolved in 20 mi of aninydrous methyJene chioride. To this soiution, it was added 0.5 mi N.N-diisopro- 
pyletliylamine and 0.3 mi (1 .34 mmol) of 2-cyanoetliyi-N,N-diisopropyiclnioropliosphoramidite. The reaction mixture 
was stirred at room temperature for 30 min and 0.5 ml of methanol was added to stop the reaction. The mixture was 
washed wfth saturated sodium bicarbonate solution and was dried over sodium sulfate. The solvent was removed in 
vacuo and a qu(cl< silica gel column with 1 % methanol in methylene chloride containing drops of triethylamine yield 
51 0 mg (93%) the desired phosphoramidite. 
Rf=0,87 (dichloromethane/methanol, 99/1), 


EXAMPLE 35 


1-(4*(3-0-4,4'-Dimethoxytritylpropoxy)-3-methoxy-6-nltrophenyl)-1-0-((2-cyanoethoxy)- 
drisopropylanilnopho&phlno)ethane 

A. 4-(3-Hydroxypropoxy)-3-methoxyacetophenone 

[0559] 3-Bromo-1 -propanol (53 ml, 33 mmol) was refluxed in 1 00 ml of anhydrous acetonitrile with 4-hycfroxy-3-meth- 
oxyacetophenone (5 g, 30 mmol), K2CO3 (6 g), and Kl (300 mg) overnight (15 h). 

Methylenechloride (150 ml) was added to the reaction mixture after cooling to room temperature. The mixture was 
filtered and the solid residue was washed with methylene chloride. The combined organic solution was evaporated to 
dryness and redissolved in 100 ml methylenechloride. The resulted solution was washed with saturated NaCI solution 
and dried over sodium sulfate. 6.5 g (96.4%) of desired product was obtained after removal of the solvent in vacuo 
Rp0.41 (dichloromethane/methanol, 96/5). 

UV (methanol), maximum: 304, 273, 227, 21 0 nm: minimum: 291 , 244. 21 4 nm. 

NMR (DMSO-de) 6 7.64 (d, 1 H), 7.46 (s, 1 H), 7.04 (d, 1 H), 458 (b, OH), 4.12 (t, 2H). 3.80 (s, 3H), 3.56 (t 2H) 2 54 
(s, 3H), l.aa (m, 2H). NMR (DMSO-de) « ^96.3, 152.5, 148.6. 129.7, 123.1, 111.5, 11 0 3 65 4 57 2 55 5 31 9 
26.3. ' ' ' ' 


B, 4-(3-Acetoxypropoxy)-3-methoxyacetophenone 

[0560] 4-(3-Hydroxypropoxy)-3-methoxyacetophenon6 (3.5 g. 15.6 mmol) was dried and dissolved in 80 ml anhy- 
drous acetonitrile. This mixture, 6 ml of trlethy?amine and 6 ml of acetic anhydride were added. After 4 h, 6 mf methanol 
was added and the solvent was removed in vacuo. The residue was dissolved in 100 mi dichloromethane and the 
solution was washed with dilute sodium bicarbonate solution, then water The organic phase was dried over sodium 
sulfate and the solvent was removed. The so/Id residue was applied to a silica gel column with methylene chloride to 
yield 4.1g of 4-(3-acetoxypropoxy)-3-methoxyacetophenone (98.6%). 
Rf = 0.22 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 303, 273, 227, 210 nm; minimum: 290. 243, 214 nm 

^^^^NMR^(DMSO-de) S 7,62 (d, 1H), 7.45 (s, 1H), 7.08 (d, 1H), 4.12 (m, 4H, 3.82 (s, 3H). 2.54 (s, 3H), 2.04 (m, 2H), 
13c NMR (DMSO-de) 5 196.3. 170.4, 152.2, 148.6, 130.0, 123.0, 111 .8, 110.4, 65.2, 60.8, 65.5. 27.9, 26.3, 20.7. 

C. 4-(3.Acetoxypropoxy)-3-methoxy-6-nltroacetophenone 

[0561] 4-(3^Acetoxypropoxy)-3-methoxyacetoph6none (3.99 g, 15 mmol) was added portionwlse to 15 ml of 70% 
HNO3 in water bath and l<eep the reaction temperature at the room temperature. The reaction mixture was stirred at 
room temperature for 30 min and 30 g of crushed ice was added. This mixture was extracted with 100 ml of dichlo- 
romethane and the organic phase was washed with saturated sodium bicarbonate solution. The solution was dried 
over sodjum sulfate and the solvent was removed in vacuo. The crude mixture was applied to a silica gel column with 
gradient methanol In methyiene chloride to yield 3.8 g (81 .5%) of desired product 4-(3-acetoxypropoxy)-3-methoxy- 
e-mtroacetophenone and 0,38 g (8%) of ipso-substituted product5-(3-acetoxypropoxy)^4-methoxy-1 ,2-dinitrobenzene 
Side ipso-substituted product 5-(3-acetoxypropoxy)-4-methQxy-1 ,2-dinitrobenzene: 
Rf=0.47 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 334, 330, 270, 240, 212 nm; minimum: 310 282 263 223 nm 

i^^^'^l^^r^'^^'^H^ ^"^^ ^-^"^ ^'^^ 2^)' ^-^^ ^^>' 4'02'(s, 3H), 2.20 (m, 2H), 2.08 (s. 3H). 

NMR (CDCI^) 6 170.9, 152.2, 151.1, 117.6. 111.2, 107.9, 107.1, 66.7, 60.6, 56.9, 26.2. 20.9 
Desired product 4-(3-acetoxyprDpoxy)-3-methoxy-e-nftroacetophenone: 
Rf=0.29 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 344, 300, 246, 213 nm; minimum: 320, 270, 227 nm. 
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1 H NMR (CDCI3) 6 7.62 (s, 1 H), 6.74 (s, 1 H), 4.28 (t, 2H), 4.20 (t, 2H), 3.96 (s, 3H), 2.48 (s, 3H). 2.20 (m. 2H). 2,06 (s. 3H). 
^^c NMR (CDCy 5 200.0, 171.0, 154.3, 148.8, 138.3, 133.0, 108.8, 108.0, 66.1, 60.8, 56.6, 30.4, 28,2, 20.9, 

D- 1-(4-(3-Hydroxypropoxy)-3-methoxy-6-nltrophenyl}ethanol 

[0562] 4-(3-Acetoxypropoxy)-3-methoxy-6-nitroacetophenone (3.73 g, 1 2 mmol) was added 1 50 ml ethanol and 6.5 
g of K2CO3. The mixture was stirred at room temperature for 4h andTLC with 5% methanol in dichloromethane indicated 
the completion of the reaction. To this same reaction mixture, it was added 3.5 g of NaBH4 and the mixture was stirred 
at room temperature for 2h. Acetone (1 0 mf) was added to react with the remaining NaBH4. The solvent was removed 
in vacuo and the residue was uptaken into 50 g of silica gel. The sriica gel mixture was applied on the top of a silica 
gel column with 5% methanol in methylene chloride to yield 3.15 g (97%) of desired product 1-{4-(3-hydroxypropoxy)" 
3-methoxy-6-nitrophenyl)ethano[, 

Intermediate product 4-{3-hydroxypropoxy)-3-methoxy-6-nitroacetophenone after deprotectior: 
Rf=0.60 (dichloromethane/methanol, 95/5). 

Final product 1 -(4-(3-hydroxypropoxy)-3-methoxy-6"nitrophenyi)etharof: 
R,-0.50 (dichloromethane/methanol, 96/5). 

UV (methanol), maximum: 344, 300, 243, 219 nm: minimum; 317. 264, 233 nm. 

1H NMR (DMSO-de) 5 7.54 (s, 1H), 7.36 (s, 1H), 5.47 (d, OH), 5.27 (m. 1H). 4.55 (t, OH). 4.05 (t, 2H), 3.90 (s, 3H), 
3.55 (q. 2H), 1 .88 (m, 2H), 1 .37 (d, 3H). 

^^C NMR {DMSO-d6)6 153.4, 146.4, 138.8. 137.9, 109.0, 108.1, 68.5. 65.9, 57.2, 56.0, 31.9, 29.6. 

E. 1-(4-(3-0-4,4'-Dimethoxytritylpropoxy)-3-methoxy-6 -nitrophenyl)ethanol 

[0563] 1-(4-{3-Hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanol (0,325 g, 1.2 mmoi) was co-evaporated with an- 
hydrous pyridine twice and dissolved in 15 mt anhydrous pyridine. The solutfon was cooled in ice-water bath and 450 
mg (1.33 mmoi) of DMTCI was added. The reaction mixture was stirred at room temperature overnight and 0.5 ml 
methanol was added to stop the reaction. The solvent was removed in vacuo and the residue was co-evaporated with 
toluene twice to remove trace of pyridine. The final residue was applied to a silica gei column with gradient methanol 
in methyfene chloride containing drops of triethylamine to yieJd 605 mg (88%) of desired product 1 -(4-{3-0-4,4'-dimeth- 
oxytritylpropoxy)-3-methoxy-6-nitrophenyl)ethanol. R, = 0.50 (dichloromethane/methanol, 95/6). 
UV (methanol), maximum: 354, 302, 282, 274, 233, 209 nm; minimum: 322, 292, 263, 222 nm, 
1H NMR (DMSO-de) 5 7.54 (s, IN), 6.8-7.4 (ArH), 5.48 (d, OH), 5.27 (m, 1H), 4.16 (t, 2H), 3.85 (s, 3H), 3 72 (s 6H) 
3.15 (t, 2H). 1 .98 (t. 2H), 1.37 (d, 3H). 

130 NMR (DMSO-de) 5 157.8, 153.3, 146.1 , 144.9, 138.7, 137.8, 135.7, 129.4. 128.7, 127.5, 127.4. 126.3, 112.9, 112.6 
1 08.9, 1 08.2, 85.1 , 65.7, 63.7, 59.2, 55.8, 54. B, 29.0. 25.0. 

F. 1-(4-(3-0-4,4'-Drniethoxytritylpiiopoxy)-3-methaxy-6 -njtrophenyl)-1-0-((2-cyanoethoxy)- 
diisopropyIaniinophosphino)ethane 

[0564] 1 -(4-(3-0-4.4'^Dimethoxytrityipropoxy)-3-methoxy-6-nitrophenyl)6thanol (200 mg. 3.5 mmol) was dried under 
high vacuum and was dissolved in 15 ml of anhydrous methylene chloride. To this solution, it was added 0.5 ml N.N- 
dlisopropylethylamlne and 0.2 ml (0.89 mmoJ) of 2-cyanoethyl-N,N-diisopropylchlorophosphoramidite. The reaction 
mixture was stinted at room temperature for 30 min and 0.5 ml of methanol was added to stop the reaction. The mixture 
was washed with saturated sodium bicarbonate solution and was dried over sodium sulfate. The solvent was removed 
in vacuo and a quick silica gel column with 1 % methanol in methyfene chloride containing drops of triethylamine yield 
247 mg (91.3%) the desired phosphoramidrte 1-(4-(3-0-4,4'-dimethoxytrityIpropoxy)-3-methoxy-6-nitrophenyl)-1-0-{ 
(2-cyanoethoxy)-diisopropylaminophosphfno)ethana. 
Rf=0.S7 (dichloromethane/methanol, 99/1). 

EXAMPLE 36 


OligonucJeotlde synthesis 

[0565] The oligonucleotide conjugates containing photocleavabis linker were prepared by solid phase nucleic acid 
synthesis (see: Sinhaet al. Tetrahedron Lett. 1983. 24. 5843-5846: Sinha etal. Nucleic Acids Res. 1964. 12. 4539-4557- 
Beaucage et al. Tetrahedron 1993, 49, 6123-6194; and Matteucci et al. J. Am. Chem. Soc. 1981, 103.3185-3191) 
under standand conditions. In addition a Jonger coupling time period was employed for the incorporation of photocleav- 
able unit and the 5' terminal amino group. The coupling efficiency was detected by measuring the absorbance of 
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released DMT cation and the results indicated a comparable coupling efficiency of phosphoramidite 1-(2-nitro-5-(3-0- 
4,4'-dimethoxytrltylpropoxy)plienyl)-1-0-((2-cyanoethoxy)-diisopropyfaminophosphino)ethane or 1-(4-{3-0-4,4'- 
dlmethoxytritytpropoxy)-3-methoxy-6-nitrophenyl)-1-0-((2-cyanoethoxy)'diisopropylaminophospliino)^^ with 
those of common nucleoside phosphoramodites. Deprotection of the base protection and release of the conjugates 
from the solid support was carried out with concentrated ammonium at 56 "C overnight. Deprotection of the base 
protection of otherconjugates was done by fast deprotection with AMA reagents. Purification of the MMT-on conjugates 
was done by HPLC (trityl-on) using 0.1 M triethylammonium acetate, pH 7.0 and a gradient of acetonltrlle (5% to 25% 
in 20 minutes). The collected MMT or DMTprotected conjugate was reduced in volume, detrltylated with 80% aqueous 
acetic acid (40 mln, 0 'C), desalted, stored at '20*C. 

EXAMPLE 37 

Photolysis study 

[0566] In a typicaf case, 2 nmol of oligonucleotide conjugate containing photocleavable ljnl<er in 200 [i\ distilled water 
was irradiated with a fong wavelength UV famp (B)ak Ray XX-15 UV lamp, Ultraviolet products, San Gabriel, CA) at a 
distance of 10 cm (emission peal<365 nm, lannp intensity - 1 .1 mW/cm2 at a distance of 31 cm). The resulting mfxtu re 
was analyzed by HPLC (trityl-off) using 0. 1 M triethylammonium acetate, pH 7.0 and a gradient of acetonitriie. Analysis 
showed that the conjugate was cleaved from the Imder within minutes upon UV irradiation. 

Equivalents 

[0567] Those sl<illed in the art wfll recognize, or be able to ascertain using no more than routine experimentation, 
numerous equivalents to the specific procedures described herein. Such equivalents are considered to be within the 
scope of this inventfon and are covered by the following cfaims. 
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SBQTTBUCB LXSTI»G 


GENERAjj-'^INPORMAT ION 

(i) APPLICANT: 

(A) NAME: 3EQUEN0M, INC. 

<B) STREET: 11555 Sorrento Valley Road 

(CJ CITY: San Diego 

<D) STATE: California 

(E) COUNTRY: USA 

(P) POSTAL CODE (ZIP): 92121 

(i) INA/ENTOR/APPLICAOT: 

.. (A) NAME: Hxjbert Koster 

(B) STREET: 836 Via Mallorca Drive 

(C) CITY; La Jolla 
<D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) ; 92 037 

(i) INVENTOR/APPLICANT: 

<A) NAME: Kai Tang 

(B) STREET: 8521 Summer dale Rd #241 

(C) CITY J San Diego 

(D) STATE: California 
(D) COUNTRY: USA 

<E) POSTAL CODE (ZIP): S2126 

(i) INVENTOR/ APPLICANT: 

(A) NAME: rong-Jing Fu 

(B) STREET: 10615 Dabney Dr. #21 

(C) CITY: San Diftgo 
{T» STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92126 

( i ) INVENTOR/ APPLI CANT : 

(A) NAME: Carsten w. Siegert 

(B) STREET; Geilstr. 42 

(C) CITY: 22303 Hamburg 
<D) STATE: 

CD) COUNTRY: Germany 
(E) POSTAL CODE (2IP} t 

{i) INVENTOR/ APPLICANT: 

(A) NAME: Daniel P. Little 

(B) STREET: 3 93 Glendale Lake Rd. 
iC) CITY: Fatten 

(D) STATE : Pennsylvania 

(D) COUNTRY: USA 

(E) POSTAL CODE fZIP) : 18668 

(i) INVENTOR/ APPLI CANT: 

(A) NAME: G. Scott Higgins 

(B) STREET: Haselweg 1 

(C) CITY; 22680 Weidel 

(D) STATE: 

(D) COUNTRY: Germany 

<E) POSTAL CODE (ZIP) ; 
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(i) INVENTOR /APPLICANT: 

(A) NAME: Andreas Braun 

. .. (B) STREET: 13232 Benchley Road 

(C) CITY: San Diego 

(D) STATE: California 

(D) COUNTRY; USA 

(E) POSTAL CODE (ZIP) : 02130 

(i) I^fV;ENTOR/ APPLICANT: 

(A) NAME: Brigitte Datnhof f er-Demar 

(B) STREET: 3899 Haines St. #8-308 

(C) CITY; San Diego 
{D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92109 

(i) INVENTOR/ APPLICANT : 

(A) NAME: Christian Jurinke 

{B) STREET; Grope Hall 68 

(C) CITY: 22115 Hamburg 

(D) STATE: 

(DJ COUNTRY: Germany 

(E) POSTAL CODE (ZIP) : 

( i ) INVENTOR/APPLICANT : 

(A) NAME: Dirk Van d«n Boom 

(B) STREET: Forsthausstr . 8 

(C) CITY: 633303 Preiech 
<D) STATE: 

(Dl COUNTRY : Germany 
(EJ POSTAL CODE (ZIP) : 

( i ) INVENTOR/APPLICANT : 

(A) NAME: Goubing Xiang 

(B) STREET; 11381 Zapata Ave. 

(C) CITY: San Diego 
(DJ STATE: California 
(DJ COUNTRY; USA 

(E) POSTAL CODE (ZIP) : 9212€ 


(ii) TITLE OP THE INVENTION: DKA DIAGNOSTICS BASED ON MASS SPECTROMETRY 

(iii) NUMBER OF SEQUENCES; 320 

(ivj CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE; Brown, Martin, Haller & McClain 

(B) STREET: 1660 Union Street 

(C) CITY: San Diego 
(D> STATE : CA 

(E) COUNTRY: USA 
(F> ZIP: 92101-2926 

(v) COMPUTER READABLE FORM; 
(A) MEDIUM TYPE: Diskette 
m) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: PastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
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(B) FILING DATE; 06 -NOV- 1997 

(C) CIASSIFICATION; 

Cvii) PRIOR APPLICATION DATA: 

(A) APPLICATION NOMBER : 

(B) FILING DATE: 10/06/97 

(vii) PRIOR APPLICATION DATA: 

<A} APPLICATION NUMBER: 08/933,792 
{B} FILING DATE: 09/19/97 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER; 08/787,539 

(B) FILING DATE: 01/23/97 

Cvii) PRIOR APPLICATION DATA: 

CA) APPLICATION NUMBER: 08/73^,988 
(B) FILING DATE: 01/23/97 

(vii) PRIOR APPLICATION DATA; 

<A> APPLICATION NUMBER: 08/746,055 
(B) FILING DATE; 11/06/96 

Cvii) PRIOR APPLICATION DATA: 

tA) APPLICATION NUMBER: 08/746,036 
(B) FILING DATE: 11/06/96 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/744,5 90 

(B) FILING DATE; 11/06/96 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/744,481 

(B) FILING DATE: 11/06/96 

(viii) ATTORNEY/AGENT INFORMATION i 

(A) NAME: Seidman, Stepiiani* L 

( B ) REGISTRATION NUMBER : 33,779 

(C) REFERENCE/DOCKET NUMBER: 7352-20 

(ix> TELECOMMUNICATION INFORMATION; 

(A) TELEPHONE: 619*238-0999 

(B) TELEFAX: 619-238-0062 
(CJ TELEX T 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS ^ 
(A) LENGTH; 20 bade pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: unknown 

<ii) MOLECULE TYPE; cDNA 
(iii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 
(v) FRAGMENT TYPE; 
<vi} ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 


99 


EP 1 164 203 A2 


GCAAGTGAAT CCTGAGCGTG 

■ -^V^ (2) INFORMATION FOR SEQ ID NO : 2 : 

{i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unJoiown 

(ii) MOLECULE TYPE: cDKA 
<iii3 HYPOTHETICAL: NO 
(iv^ ANTISENSE: NO 
<v) FRAGMENT TYPE: 
<vi) ORIGINAL SOORCE: 

(xi) SEQUENCE DESCRIPTION: SEQ XD NO; 2 
GTGTGAAGGG TTCATATGC 

t2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
(y) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
ATCTATATTC ATCATAGGAA ACACCACA 

{2> INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 30 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID UO : 4 : 

GTATCTATAT TCATCATAGG AAACACCATT 

(2] INFORMATION FOR SEQ ID IJO 

(i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
(BJ TYPE; nucleic acid 
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(C) STItAKDEDWESS ; single 

(D) TOPOLOGY; unknown 

""{±±) MOLECUI^ TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
tvi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTTON: SEQ ID NO : 5 
GCTTTGGGGC ATGGACATTG ACCCGTATAA 


(2) INFORMATION FOR SEQ ID NO: 6: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 30 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: imknovm 

Cii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:€; 
CTGACTACTA ATTCCCTGGA TGCT3GGTCT 

(2) INFORMATION FOR SEQ ID NO: 7: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi] ORIGINAL SOURCE: 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO ; 7 : 
TTGCCTGAGT GCAGTATGGT 

( 2 ) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
fA) LENGTH; 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: imknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 
Civ) ANTISENSE: NO 
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<v) FRAGMENT TYPE; 
_ (vi) ORIGINAL SOTOCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 8 
AGCTCTATAT CGGGAAGCCT 


(2) INFORMATION FOR SEQ ID NOr9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE; cDNA 
{iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPEi 

(vi) ORIGINAL SOURCE; 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 ; 
TTGTGCCACG CGGTTGGGAA TGTA 


(2) INFORMATION FOR SEQ ID NOrlO: 

Ci) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 26 base pairs 
(B) TYPE; nucleic acid 
iC) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; HO 

(iv) ANTISENSE: HO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
AGCAACGACT GTTTGCCCGC CAGTTG 


(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: Single 
(D) TOPOLOGY; Unknown 

(ii) MOLECULE TYPE: CDHA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; HO 
(v> FRAGMENT TYPE: 
tvi) ORIGINAL SOURCE: 

(XL) SEQUENCE DESCRIPTION; SEQ ID NOill; 
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TACATTCCCA ACCGCGTGGC ACAAC 


(2) INFORMATION FOR SEQ ID NO: 12: 

(1) SEQUENCE CHARACTERXSTICS : 

(A) I^ENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
iC) STRANPSDNESS : single 
<D} TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(ivJ AHTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
AACToaCGGG CAAACAGTCG TTGCT 

(2) INFORMATION FOR SEQ ID WO; 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: unicnOTO 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(Vi> ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0;13; 
GCAAGTGAAT CCTGAGCGTG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnowti 

(ii) MOLECULE TYPE*. cDNA 
(iii J HYPOTHETICAL: NO 
Uv) ANTISENSE: NO 

(Vj FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID rrO:l4: 
GTGTGAAGGG CGTG 

(2) INFORMATION FOR SEQ ID NO : L b 

(i) SEQUENCE CHARACTERISTICS: 
(Al LENGTH: 24 base pairs 
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(B) TYPE: nuclfeic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 
(iv> ANTISENSE: NO 

(V) FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
CTATATTCAT CATAGGAAAC ACCA 

(2) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS r 

(A) LENGTH: IB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 
<D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ie; 
GTCACCCTCG ACCTGCAG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTEJETICAL : NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID WO: 17: 
TTGTAAAACG ACGGCCAGT 

(2) INFORMATION FOR SEQ ID NO^IS: 

Ci3 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ifl base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE: 

(vi) ORIGIKAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION; SEQ ID HO: 18 
CTTCCACCGC GATGTT<ai 

{2} INFORMATION FOR SEQ ID K0:i9i 

(i) SEQUENCE CHARACTERISTICS: 

(A) 1-BNOTH: 17 i:>aae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unJoiown 

(ii) MOI^CULE TYPE: cDNA 
fiii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 19: 
CA6GAAACAG CTATGAC 

<2) INFORMATION FOR SEQ ID NO: 20: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNBSS ; single 
(D) TOPOLOGY: \inknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL; NO 
(iv) ANTISENSE: HO 
(V) FRACaiENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 20: 
GTAAAACGAC GGCCAGT 

(2) INFORMATION FOR SEQ ID NO; 21; 

Ci) SEQUENCE C3iARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: xinknown 

Ui) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<Xi) SEQUENCE DESCRIPTION; SEQ ID SO 
GTCACCCTCG ACCTGCAGC 
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{2) INFORMATION FOR SEQ ID NO:22: 

■^■^^' (i) SEQUENCE CHARACTERISTICS r 

(A) LBKGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS ; single 
CD) TOPOLOGY: unknown 

fii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID KO:22 
GTTGTAAAAC GAGGGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAKDEDNES S : s ingl e 

(D) TOPOLOGY: unknown 

fii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
{v> FRAGMENT TYPE: 
(vi) ORIGINAL SOXJRCE: 

ixij SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TCTGGCCTGG TGCAGGGCCT ATTGTAGTTG TGACGTACA 

(2) INFORMATION FOR SEQ ID NO: 24: 

U) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
CD) TOPOLOGY: unknown 

(ii> MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO:24: 
TGTACGTCAC AACT 

(2) INFORMATION FOR SEQ ID NO; 25; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 7S base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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iii) MOLECULE TYPE: cDNA 

fiii) HYPOTHETICAL: NO 
..-.iiv) ANTISENSE: NO 
' ' (v) FRAGMENT TYPE; 

(Vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID 110:25: 

AAGATCTGAC CAGGGATTCG GTTAGCGTGA CTGCTGCTGC TGCTGCTGCT GCTGGATGAT 
CCGACGCATC AGATCTGG 

(2) INFORMATION FOR BEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 ba«e pairfi 
(BJ TYPE: nucleic acid 

(C) STRANDEDNCSS : single 

(D) TOPOLOGY: un3cnovm 

iii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAC*IEKT TYPE; 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26; 
CTGATGCGTC GGATCATC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 22 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOt27: 
GATGATCCGA CGCATCACAG CTC 

(2) INFORMATION FOR SEQ ID NO: 28; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETlCyiL : NO 
(ivj ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ -ID NOr28 

TC:rCv"CCAA GAGCTGTGRT GCGTCGGRTC ATC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(ij SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(B> TYPEi nucleic acid 
CO STRANI3EDNESS i single 
CD) TOPOLOGY; xinknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTKETICAL : NO 
" (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29; 
GATGATCCGA CGCATCACAG CTC 

{2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: IS bas« pairs 
(B) TYPE; nucleic acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: untoown 

(ii) MOLECULZ TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
liv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE^ 

Ui) SEQUENCE DESCRIPTION; SEQ ID NO; 30: 
GTGATGCGTC GGATCATC 

(2) INFORMATION FOR SEQ ID NO; 31; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEN(m3: 15 base pairs 

{B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

fr>) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
iv) FRAGMENT TYPE; 
(v-i) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 3 1 : 
TCGGTTCCAA GAGCT 

(2) INFORMATION FOR SEQ ID NO ^ 3 1 : 
fi) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
r'- '^l^ (C) STRANDEDNESS : single 

(D) TOPOLOGY: imknown 

(iij MOLECUXiE TYPE; CDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
TCGGTTCCAA GAGCT 

i2) INFORMATION FOR SEQ ID NO; 32; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 21 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE I NO 
<v) FRAGMENT TYPE: 
fvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0;32; 

CATTTGCTTC TGACACAACT G 

(2) INFORMATION FOR SEQ ID KO;33: 

(i> SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECDt#E TYPE: cDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTTCTCTGTC TCCACATGC 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) AMTISENSE: NO 

(v) FRAGMENT TYPE: 
-Xvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
TGCACCTGAC TC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 20 base pairs 

(B) TYPE; nucleic acid 
CCJ STRAKDEDNESS ; single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AKTISENSE: NO 
(V) FRAGMENT TYPE: 
(viJ ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3S: 
TGCTTACTTA ACCCAGTQTO 


(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
(AJ LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL ; NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE; 
fvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3€: 
CACACTATGT AATACTATGC 

<2) INFORMATION FOR SEQ ID NO: 37 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 
(iv> ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ HO jl: 
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GAAAATATCT OACAAACTCA TC 2 2 

(2) INFORMATION FOR SEQ ID NO;38; 

^ {i> SEQUENCE CHARACTERISTICS: 

(A> LEKGTH: 21 base pairs 

(B) TYPE; nucleic acid 

(C) STRAHDEDITESS : single 

(D) TOPOLOGY: unicnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
tiv) ANTlSENSEi MO 

(v) FRAGMENT TYPE? 

( vi ) ORIG INAL SOURCE : 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CATGGACACC AAATTAAGTT C 21 

(2) INFORMATION FOR SEQ ID NO: 39: 

20 

{±) SEQUENCE CHARACTERISTICS: 

(A) LEKGTH: 14 base pairs 

(B) TYPE I nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

35 

Cii) MOLECULE TYPE: cDNA 
<iiij HYPOTHETICAL: NO 
(iv} ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

30 

iKX) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
TGAGACTCTG TCTC 14 
(2 J INFORMATION FOR SEQ ID NO: 40: 

35 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 « base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS; single 
(D) TOP0UX3Y: unknown 

40 

(ii) MOLECULE TYPE: cDNA 
(ili) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 
Cv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

45 

Ui) SEQUENCE DESCRIPTION; SEQ ID NO: 40: 
TTCCCCAAAT CCCTG 15 
(2) INFORMATION FOR SEQ ID NO: 41: 

50 

ii) SEQtJENCE CHARACTERISTICS; 
I A) LENGTH: IS base pairs 


55 
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(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 
■c:.^ <D) TOPOIiOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(tii) HYPOTHETICAL ] NO 
(iv) ANTISEKSE: NO 

(V) FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOr41 
GGCACGGCTG TCCAAGGAG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

CO STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSEi NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 42 
AGGCCGCGCT CGGCGCCCTC 

(2) INFORMATION FOR SEQ ID NO; 43: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 18 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY I unknown 

(ii) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: NO 
Uv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTIONi SEQ ID NO:4 3 
GCGGACATGG AGGACGTG 

(2) INFORMATION FOR SEQ ID NO: 44; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE: 
Wi) ORIGINAL SOURCE; 

'■''^(xi) SEQUENCE DESCRIPTION; SSQ ID NO; 44: 

5 

GATGCCGATG ACCTGCAGAA G 21 
(2) INFORMATION FOR SEQ ID NO; 45: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS : Single 
<D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: cDNA 
^5 Ciii) HYPOTHETICAL: NO 

{ iv ) AKTI SENSE : NO 
<v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;45; 

20 

CCCTTACCCT TACCCTTACC CTAA 2 4 

(2) INFORMATION FOR SEQ ID NO: 46: 

ti) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: IB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sin&'le 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
30 (iii) HYPOTHETICAL: NO 

tiv) ANTISENSE: NO 
fv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0;46: 

35 

AATCCGTGCA GCAGAGTT 18 
(2) INFORMATION FOR SEQ ID NO: 47; 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 base pairs 

(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
45 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
fv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ II? NO:47i 

50 

TGTCAGAGCT GGACAAGTGT 20 


55 
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(2) INFORMATION FOR SEQ ID NO :4a; 

-^ii) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2 0 base pairs 

(B) TYPE; nucleic acid 
{C) STRANDE0NES5 : single 
(D) TOPOLOGY: UJfXtaOWn 

(ii) MOLECULE TYPE: cDNA 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

-- (Xi) SEQXJENCE DESCRIPTION: SEQ ID KO:4a: 

15 

GATATTGTCT TCCCGGTAGC 20 
(2> INFORMATION FOR SEQ ID NO; 49: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A> LENGTH: 20 base pairs 

(B) TYPE; nucleic acid 
{€) STRANDEDNESS : single 
(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 
25 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

30 

CTCGGACCAG GTGTACCQCC 20 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQXJENCE CHARACTERISTICS: 
35 (A) LENGTH; 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t single 

(D) TOPOLOGY I \anJcnown 

<ii) MOLECULE TYPE; CDNA 
40 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION i SEQ ID NO: 50: 

45 

CCTGTACTGG AAGGCGATCT C 21 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
50 <A) LENGTH; 20 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJmown 
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Cii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
5 "^-.V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 51: 
CATGAGGCAG AGCATACGCA 20 

10 

(2) INFORMATION FOR SEQ ID WO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 
fB} TYPE; nucleic acid 

15 " tC) STRANDEDNESS; single 

{D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
Civ) AHTISENSE: HO 

20 (V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO; 52; 

GACAGCAGCA CCGAGACGAT 20 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 20 basft pairs 

(B) TYPE; nucleic acid 
30 <C) STRANDEDNESS : single 

(D) TOPOLOGY; unJcnown 

Cii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSEt NO 
35 <V> FRAOffiNT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53; 
CGGCTGCGAT CACCGTGCGG 20 
(2) INFORMATION FOR SEQ ID NO; 54; 


40 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: CDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
{v) FRAGMENT TYPE: 
Evi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID 


50 
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GATCCACTGT GCGACGAGC 19 
(2) INFORMATION FOR SEQ ID NO:55: 

5 ->^\ 

(I) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: base pairs 

(B) TYPE; nucleic acid 
<C) STRANDEDNESS : single 
( D ) TOPOLOGY : unknown 

(II) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0;5S: 
GCGGCTGCGA TCACCGTGC 19 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: base pairs 

CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iiij HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

<v) FRAGMENT TY:>Ei 
{vi) ORIGINAL SOURCE: 


30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S6i 


TGCACCTGAC TC 


12 


f2) INFORMATION FOR SEQ ID K0:57; 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
(0} STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NOtS?; 
CTGTGGTCGT GC 12 
(2) INFORMATION FOR SEQ ID NO :5a: 

^° (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
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CD) TOPOLOGY: unknown 

.....-^^i) MOLECULE TYPE; cDNA 
'(iii) HYPOTHETICAL: NO 
^ (iv) ANTISENSE: NO 

<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:S8: 

GAGTCAGGTG CGCCATGCCT CAAACAGACA CCATGGCGC 39 

C2) INFORMATION FOR SEQ ID NO: 59: 

-'(i) SEQUENCE CHARACTERISTICS; 
(AJ LENGTH: 21 base pairs 
(B} TYPE; nucleic acid 

(C) STRAHDEDNESS : single 

(D) TOPOLOGY: unknown 

{ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

TCTCTGTCTC CACATGCCCA Q 21 

(2) INFORMATION FOR SEQ II? NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: base pairs 
^° <a) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: unknown 

<ii> MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
^5 (iv) ANTXSEHSE: NO 

(V) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ACCTAGCGTT CAGTTCGACT GAGATAATAC GACTCACTAT AGCAGCTCTC ATTTTCCATA 60 
C €1 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISBNSE: NO 

(v) FRAGMENT TYPE: 
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Cvi) ORIGINAL SOURCE: 

SEQUENCE DESCRIPTION: SEQ ID NO: 61 
AACTAAGCCA TGTGCACAAC A 

{ 2 ) INFORMATION FOR SEQ ID NO : 62 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bajse pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 
( D > TOPOLOGY : tmknown 

"Tii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSEt no 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 62 
UCCGGUCUGA UGAGUCCGUG AGQAC 

{2) INFORMATION FOR SEQ ID NO:fi3: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANBEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; RNA 
{iii) HYPOTHETICAL: NO 
<lv) ANTISENSE: NO 

(vj FRAGMENT TYPE; 
Cvi> ORIGINAL SOURCE? 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: €3 

GUCACUACAG GUGAGCUCCA 

(2) INFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENX3TH: 20 base pairs 
iB) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(xi) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
Cv) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 

<xi) SEQtffiNCE DESCRIPTION: SEQ ID NO : 64 
CCAUGCGAGA GUAAGUAGUA 

(2) INFORMATION FOR SEQ ID NO: 65: 
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{i) SEQUENCE CKARACTERISTICS : 
(A) LENGTH: 50 base pairs 
> . (B) TYPE: nucleic acid 
5 (C) STRANDEDNESS : single 

<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 
{iii) HYPOTHETICAL: NO 
(iv) ANTISEHSE: HO 
iO FRAGMENT TYPE: 

(Vi) ORIGINAL SOURCS: 

(xi) SEQUENCE DESCRIPTION: SEQ ID KO:6S: 
AGGCCUGCGG CAAGACGGAA AGACCAUGGXT CCCONAUCUG CCGCAGGAUC SO 
(2) INFORMATION FOR SEQ ID NO: 66: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: ^m3cnown 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 

25 <v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CATTTGCTTC TGACACAACT 20 

30 

<2) INFORMATION FOR SEQ ID NO; 67: 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEtXNESS ; single 

(D) TOPOLOGY: unkncwn 

{ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
{vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:67: 
TCTCTGTCTC CACATGCCCA G " 21 

(2) INFORMATION FOR SEQ ID NOrfiS; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D} TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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10 


(iii) HYPOTHETICAL: NO 
liv) ANTISENSE: NO 
FRAGMENT TYPE: 
{vi) OHIGIMAL SOURCE; 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO;68: 
GTCGTCCCAT GGTGCACCTG ACTC 24 
{2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS; 
(A) XiENGTH: 22 base pairs 
<B) TYPE: nucleic acid 

- CO STR/^EDNESS: single 
(D) TOP03XX3Y: unknowl 

15 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
{iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

20 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CGCTGTGGTG AGGCCCTGGG CA 22 
C2] INFORMATION FOR SEQ ID NO: 70: 

25 

(i) SEQUENCE CHARACTfiRTSTlCS ; 

(A) LENGTH: 24 ba^e pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lonknown 

30 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GACGACGACT GCTACCT6AC TCCA 24 
(2) INFORMATION FOR SEQ ID NO: 71: 


40 


45 


50 


55 


ii) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID U-' 
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ACAGCGGACT GCTACCTCAC TCCA 24 
. ._(2) lifFORMATION FOR SEQ ID NO : 72 : 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

' {C) STRANDEDHE5S : single 

(D) TOt^OLOGY; un)cnown 

Ui) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
<iv) ANTISEMSE: NO 

<v) FRAC»1ENT TYPE: 
-(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID W0:72: 

TGGAGTCAGG TAGCAGTC IS 

(2) IKFORKATIOM FOR SEQ ID NO : 73 ; 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 19 base paira 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: vuiknown 

(ii) MOLECULE TYPE: cDNA 
(iii> HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Ui) SEQXJENCE DESCRIPTION: SEQ ID KO:73: 
CAQCTCTCAT TTTCCATAC X9 
(2) INFORMATION FOR SEQ ID NO; 74: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: CDNA 
tiii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
Cv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

[xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 74 ; 
AGCCCCAAGA TGACTATC IS 
(2) INFORMATION FOR SEQ ID NO: 75: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

5 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AMTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CGAGGAGCTC AAGGCCAGAA T 21 
(2) INFORMATION FOR SEQ ID NO: 76: 

1^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

2^ (Xi) SEOUENCB DESCRIPTION: SEQ ID NO: 75: 

CAGGGGCAGC TCAGCTCTC 19 
(2) INFORMATION FOR SEQ ID NO: 77; 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH*. 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL s NO 
<iv) AMTISENSE: NO 

(v) FRACMENT TYPE; 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

40 

GGCACGGCTG TCCAAGGA 18 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
<B) TYPE: nucleic acid 

(CJ STRANDEDNESS; single 
(D) TOPOLOGY: vinknown 

(ii) MOLECULE TYPE: CDNA 
^° (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 


55 
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<vi) ORIGINMi SOURCE: 
. SEQUENCE DESCRIPTION: SEQ ID KO:78 

5 

AGGCCGCGCT CGGCGCCCTC 20 

(2) INFORMATION FOR SEQ ID NO: 79: 

10 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 18 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

15 (ii) MOLECULE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: HO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

20 <xi| SEQUENCE DESCRIPTION; SEQ ID NO: 79; 

CTTACTTGAA TTCCAAQAGC 20 

(2) INFORMATION FOR SEQ ID NO: BO; 

2s Ci) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 22 base pairs 
<B) TYPE; nucleic acid 
to STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISBNSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GGGCTGACTT GCATGGACCG OA 22 

(2) INFORMATION FOR SEQ ID NO; 51 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
iC) STRAKDEDIilESS : single 
iB) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
AGCCAGGACA AG 12 
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(2) INFORMATION FOR SEQ ID NO: 82: ■ 

f^X. SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 15 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS I single 
<D) TOPOLOGY ^ unknown 

(ii) MOLECULE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE; 

SEQUENCE DESCRIPTION: SEO ID NO: 82: 

^5 ACAGCAGGAA CAGCA IS 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 
(AJ LENGTH: 18 base pairs 

^0 {Bl TYPE: nucleic acid 

{C} STRANBEDNESS : single 
<D) TOPOLOGY: untoown 

<ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 
25 (ivi ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOUriCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

30 GCGGACATGG AGQACGTG 18 

(2) INFORMATION FOR SEQ ID NO 164: 

{i) SEQUENCE CHARACTERISTICS ; 
(A) LENGTH: 21 base pairs 
35 tB) TYPE: nucleic acid 

CO STRANDEDNESS : single 

(D) TOPOLOGY; unJtnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
40 {iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO -.64: 

45 GATGCCGATG ACCTGCAGAA G 21 

(2) INFORMATION FOR SEQ ID NO: 85; 

(i) SEQUENCE CHARACTERISTICS; 
(AJ LENGTH; 24 base pairs 
50 (B) TYPE : nucleic acid 

to STRANDEDNESS: single 
(D) TOPOLOGY: unJcnown 
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(ii) MOLECUIiE TYPE: CDNA 
tiii) HYPOTHETXCAL; NO 
. ANTISBNSE: NO 

(V) FRAGMENT TYPE: 
5 (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

GTGCCCTGCA GCTTCACTGA AGAC 24 

10 

(2) INFORMATION FOR SEQ ID NO; 86: 

(i) SEQUENCE CHARACTERISTICS; 
- - (A) LENGTH; 12 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
20 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOTJRCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 86: 
AGCCAQGACA AG 12 

25 

(2) INFORMATION FOR SEQ ID NO: 87 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 -base pairs 

(B) TYPE: nucleic acid 
30 <C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISBNSE: NO 
35 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION ^ SEQ ID NO: 87: 
AGCCAGGACA AGTC 14 

40 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 base pairs 

45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: tjnknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 
50 (iv) ANTISENSE: NO 

<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 
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20 


Ui) SEQUENCE DESCRIPTION: SEQ ID NO: SB: 
AGCCACC•^:'^ AGA 13 

5 

(2) INFORMATION FOR SEQ ID NO: 89; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 

--5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 
(d) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDKA 
(i-ii) HYPOTHETICAL I NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION.: SEQ ID NO: 89: 
ACAGCACCAA CAGCA 15 

(2) INFORMATION FOR SEQ ID NO ; 90 : 

Ci) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEC»IESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
^° (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: SO: 
ACAGCAOGAA CAGCATC 17 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY; unJcnown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AimSENSE: NO 
iv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 91 : 

SO 

ACAGCAGGAA CAGCAG 16 
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(2} INFORMATION FOR SEQ ID NO ; 92 : 

/j-V SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL; NO 

(iv) ANTISENSEj NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE; 

C-xi) SEQUENCE DESCRIPTION: SEQ ID NOr92: 
GCGGACATGG AGGACGTG 


{2) INFORMATION FOR SEQ ID NO; 93: " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSEr NO 

(v) FRAOtlSNT TYPE: 

(vi) ORIGINAL SOURCE; 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GCGGACATGG AGQACGTGGC 

(2) INFORMATION FOR SEQ ID NO; 94: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AHTISENSE: NO 
{V> FRA01ENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 94: 
GCGGACATGG AGGACGTGC 

(2) INFORMATION FOR SEQ ID NO: 95: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
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(D) TOPOLOGY: unknown 

^7.v) MOLECULE TYPE: cDNA 
5 (iii> HYPOTHETICAL: NO 

(iv) AKTISENSE: NO 

(v) FRAGMENT TYPE: 
(v±) ORIGINAL SOURCE: 

(Xi) SEOUENCE DESCRIPTION: SEQ ID NO:55; 
GATGCCGATG ACCTGCAGAA G 21 

(2) INFORMATION FOR SEQ ID NOrSS: 

t5 (i) SEQUENCE CHARACTERISTICS i 

<A) LENGTH: 22 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
<D) TOPOLOCSY: mlknown 

2« (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 
(v> FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:S6: 

GATGCCGATG ACCTGCAGAA GC 22 

<2) INFORMATION FOR SEQ ID NO: 97; 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 23 base pairs 

<B) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

_ (ii) MOLECXJLE TYPE: cDNA 

{iii) HYPOTHETICAL: NO 

(iv) AHTISSNSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(5Ci) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GATGCCGATG ACCTGCAGAA GTG 23 
(2 J INFORMATION FOR SEQ ID NO; 58: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(S) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: HO 
(v> FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

^\ .L) SEQUENCE DESCRIPTION: SEO ID HO: 98; 

5 

GTGCCCTGCA GCTTCACTQA AGAC 24 

(2) INFORMATION FOR SEQ ID NO: 99: 

(1) SEQUENCE CHARACTERISTICS: 
to {A) LENGTH: 26 base paire 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(li) MOLECULE TYPE: cDNA 
15 (iii) HYPOTHETICAL: NO 

(iv) ANTISEUSE: NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 99: 

20 

GTGCCCTGCA GCTTCACTQA AGACTQ 26 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 
2g (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii> MOLECULE TYPE: CDNA 
30 (lii) HYPOTHETICAL: NO 

(Iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GTGCCCTGCA GCTTCACTGA AGACC 25 

(2) INFORMATION FOR SEQ ID NO 1 101: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH J 19 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS 1 single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 
(iiij HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE; 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO ^ 101: 
TATCTGTTCA CTTGTGCCC 19 
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{2) INFORMATION FOR SEQ ID NO ; 102 ; 

-r- 'i) SEQUENCE CHARACTERISTICS t 
(A) LENGTH: 19 baae pairs 
CB) TYPE: nucleic acid 

(C) 5TRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL; NO 

(iv) ANTISENSB: HO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

--{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

CAOAOaCCTG GGQACCCTG 19 


(2) INFORMATION FOR SEQ ID NO; 103; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
{B) TYPE: nucleic acid 
CO STRAKDEDKESS : single 
(D) TOPOLOGY: unknown 

Ui) MOLECULE TYPE; cDNA 
{iii) HYPOTHETICAL: NO 
(ivj ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQXJENCE DESCRIPTION;. SEQ ID NO; 103: 


30 


AC6ACAGGGC TGGTTGCC 


IB 


(2) INFORMATION FOR SEQ XD NO : 104 : 


35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
tC> STRANDEDHESS : Single 
(D) TOPOLOGY: unknown 


40 


(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 


45 


ACTGACAACC ACCCTTAAC 


19 


(2) INFORMATION FOR SEQ ID NO:10^. 


50 


(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
{B) TYPE; nucleic acid 
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(C) STRANDEDNESS: single 
iD) TOPOLOGY; unknown 

MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) 7VNT1SENSE: WO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO; 105: 
CTGCTTGCCA CAGGTCTC 18 


(2) INFORMATION FOR SEQ ID NO: 106; 

(i) SEQOEWCB CHARACTERISTICS; 

(A) LENGTH; 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
{D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 
(iv> ANTI SENSE: NO 

<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:106: 
CACAGCAGGC CAGTCTGC 18 


(2) INFORMATION FOR SEQ ID NO: 107: 

30 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 19 base pairs 
CB) TYPE: nucleic acid 

iC) STRANDBDNESS: Single 
(D> TOPOLOGY; unknown 

35 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
Civ) ANTISBNSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GGACCTGATT TCCTTACTG 19 


<2) INFORMATION FOR SEQ ID NO: 108; 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii> HYPOTHETICAL: NO 
(iv) ANTISENSE^ NO 
, .|yV, FRAGMENT TYPE: 
^ (vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : LOS : 

TGAATCTGAG QCATAACTG 19 

^° (2} INFORMATION FOR SEQ ID NO: 109; 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH J 50 base pairs 
--(B) TYPE: nucleic acid 
. (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

{ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
{iv) AKTISENSE: NO 
(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X09: 
TTGCGTACAC ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT SO 

25 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 27 base pairs 
{B) TYPE: nucleic acid 

^° (C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
^5 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
GTAAAACQAC GGCCAGTGTG TACGCAA 27 

40 

(2) INFORMATION FOR SEQ ID NO; 111; 

<i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 2 7 base pairs 
CB) TYPE J nucleic acid 
iC) STRANDEDNESS: single 
(D) TOPOLOGY t unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TACTGRAaqG CGATCTCAGC AATCAGC 2 7 

^ (2) INFORMATION PGR SEQ ID NO ; 112 : 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDEDNESS I Single 
^° (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
<iii) HYPOTHETICAL; NO 

-Uv) ANTISENSE; NO 
(v) FRAGMENT TYPE: 
^5 (Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

GGCACGGCTG TCCAAGOAG X9 

{IBB) INFORMATION FOR SEQ ID NO:113: 

(i> SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 
55 (D> TOPOLOGY: imknown 

<ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

30 (Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:113; 
AGGCCGCGCT CGGCGCCCTC 20 

35 

(2) INFORMATION FOR SEQ ID NO; 114: 

ti) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(iij MOLECULE TYPE; RNA 
Ciii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
45 (v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 114: 
GUCACUACAG GUGAQCUCCA 2 0 

50 

{2) INFORMATION FOR SEQ ID NO: 115: 
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<i) SEQUENCE CHARACTERISTICS; 

(AJ LENGTH: 21 bade paiirs 
- :d) TYPE: nucleic acid 
5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDWA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
iO (v) FRAGMENT TYPE: 

(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GAATTCGAGC TCGGTACCCG G 21 

15 

(2) INFORMATION FOR SEQ ID NO: 116: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
20 CC) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
25 iv) FRA01ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 116: 
CCGGQTACCG AGCTCGAATT C 21 

30 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: unJtnown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 

Cv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO; 117: 
CCTCTTGGGA ACTGTGTAGT ATT 23 
''^ (2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE; nucleic acid 

„ (C) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDHA 
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Ciii) HYPOTHETICAL: MO 
Civ) ANTI SENSE: NO 
fv.) FRAGMENT TYPE: 
5 "tvi) ORIGINAL SOURCE: 

<xi) SEQOENCE DESCRIPTION; SEQ ID N0:11B: 

AGQCTGTCTC TCTCCCTCTC TCATACACAC ACACACACAC ACACACACAC ACACACACAC 60 
ACACACACAC TCACACTCAC CCACANNNAA ATACTACACA GTTCCCAAGA GG 112 

10 

<2) INFORMATION FOR SEQ ID NO: 119; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 base pairs 

-- (B) TYPE: nucleic acid 
15 (C) STRAKDEDHESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; CDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTISEMSE: NO 

20 (v) FRAGMENT TYPE: 

{vi) ORIGINAL SOURCE: 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
TAATACGACT CACTATAGGG CGAAGGCTGT CTCTCTCCCT CTCTCATAC 49 

25 

(2) INFORMATION FOR SEQ ID NO; 12 0: 

(i) SEQUENCE CHftRACTERISTICS ; 

(A) LENGTH; 135 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(ivj ANTISENSE: NO 

55 (v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTIONr SEQ ID NO: 120: 

TAATACGACT CACTATAGGG CGAAGGCTGT CTCTCTCCCT CTCTCATACA CACACACACA 6 0 
CACACACACA CACACACACA CACACACACA CACTCACACT CACCCACANN NAAATACTAC 12 0 
ACAGTTCCCA AGAGG 135 

(2) INFORMATION FOR SEQ ID NO; 12 1 i 

(i) SEQUENCE CHARACrrSRlSTICS : 
45 (A) LENGTH: 12 base pairs 

(B) TYPE; nucleic acid 
{ C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
50 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE i NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE; 
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[xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: 
AATACT?-C'' Z" iW3 12 

5 

(2) INFORMATION FOR SEQ ID NO: 122; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS J unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cCNA 
Ciii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 

(v> FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION^ SEQ ID NO:122: 
CTGATGCGTC GGATCATCTT TTTT 24 

20 

(2) INFORMATION FOR SEQ ID NO^I23; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 23 base pairs 

(B) TYPE: nucleic acid 
^5 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown. 

(ii) MOLECULE TYPE: cDMA 
Ciii) HYPOTHETICAL; NO 
(iv) AKTISENSE: NO 

30 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 123; 
GATGATCCGA CGCATCAGAA TGT 23 

3S 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 29 base pairs 
(B) TYPE; nucleic acid 

40 iC) STRANDEDNESS; unknown 

<DJ TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
(iv) ANTISENSEi NO 

45 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xij SEQUENCE DESCRIPTION; SEQ ID NO: 124: 
GATCTAGCTG GGCCGAGCTA GGCCGTTGA 2 9 

50 

(2) INFORMATION FOR SEQ ID NO: 12 5: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 27 base pairs 

CB) TYPE; nucleic acid 

. r. (C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AHTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 5 

CTGATGCGTC GGATCATCTT TT Tin ^' XT 

(2) INFORMATION FOR SEQ ID NOrl26: 

(1) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 12 base pairs 
<B) TYPE: nucleic- acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: untaown 

Cii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 
(vt) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 126: 
GATGATCCGA CG 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i> SEQUENCE CHARACTERISTICS: 
(a; LENGTH: 15 base pairs 
fB) TYPE: nucleic acid 
to STRANDEDNESS: Single 
(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE: 

fxi) SEQXJENCE DESCRIPTION: SEQ ID NO: 127: 
GATGATCCGA CGCAT 

(2) INFORMATION FOR SEQ ID NO: 12 3; 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
{C3 STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 
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<iv) ANTISENSE: NO 
iv) FRAGMENT TYPE^ 
i:-L-) ORIGINAL SOURCE: 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :12a: 
AAAAAAGATG AT 12 
(2) INFORMATION FOR SEQ ID NO: 123: 

10 

U) SEQUENCE CHARACTERISTICS: 

(A) I^NGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS ; dingle 
-(D) TOPOLOGY: unknown 

(ii) MOI^ECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
{vi) ORIGINAL SOURCE: 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:129: 
GATCCGACGC AT X2 
12) INFORMATION FOR SEQ ID NO: 13 0: 

25 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 253 base pairs 
;b) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY; un)cnown 

30 

i±i) MOLECULE TYPE: CDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

35 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 130: 

GGCACGGCTG TCCAAGQAGC TGCAGGCGGC GCAGGCCCGG CTGGGCGCGG ACATGGAGGA 00 
CGTGTGCGGC CGCCTGGTGC AGTACCGCGG CQAGGTGCAG GCCATGCTCG GCCAGAGCAC 120 
CGAGGAGCTG CGGGTOCOCC TCGCCTCCCA CCTGCGCAAG CTGCGTAAGC QGCTCCTCCG 18 0 
40 CGATGCCGAT GACCTGCAGA AGTGCCTGGC AGTGTACCAG GCCGGGGCCC GCGAGGGCGC 240 

CGAGCGCGGC CTC 253 

(2) INFORMATION FOR SEQ ID NO;i3lr 

U) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTHi 58 base pairs 

OJ TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
50 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

55 
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{Xi} SEQUENCE DESCRIPTION: SEQ ID MO: 131: 
GAATTAra.^i'T. CCCAACCGCG TGGCACAACA ACTGGCGGGC AAACAGTCGT TGCTGATT 58 

5 

C2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 57 base pairs 
(BJ TYPE: nucleic acid 

10 {C) STRANDEDNESS : single 

(D) TOPOLOGY: Xinknown 

{ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL i NO 
(-i-v) ANTISEKTSB: NO 
15 (V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
ACCATTAAAG AAAATATCAT CTTTGGTGTT TCCTATGATG AATATAGAAG CGTCATC 57 

20 

<2) INFORMATION FOR SEQ ID NO: 133: 

{i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 29 base pairs 

(B) TYPE; nucleic acid 
25 (C> STRANDEDNESS: single 

(D) TOPOLOGY: unJoiown 

(ii> MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL; NO 
{iv) ANTISENSE: NO 
30 fv) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:133: 
CTATATTCAT CATAGGAAAC ACCAAAGAT 29 
<2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 
<AJ LENGTH: 26 base pairs 
(B) TYPE: nucleic acid 

4Q (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
45 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
CTATATTCAT CATAGGAAAC ACCAAT 26 

50 

(2) INFORMATION FOR SEQ ID NO: 135: 
(i> SEQUENCE CHARACTERISTICS: 
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{A) LENGTH: 29 baae pairs 

(B) TYPE: nucl«ic acid 

. . . -. 'O STRAITOEDNESS : single 

(D) TOPOLOGY; unknown 

(ii) MOLECUI^ TYPE: cDWA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEMSE: NO 
tv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13S 
CTATATTCAT CATAGGAAAC ACCAAAGAT 

(2) INFORMATION FOR SEQ ID NO: 136: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 
to 5TRANDEDNHSS: single 
(D> TOPOLOGY: unknown 

{ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 
Civ) ANTXSENSE: NO 
{v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 136 
CTATATTCAT CATAGGAAAC ACCAAAGATG ATATTTTC 

{2) INFORMATION FOR SEQ ID NO: 13 7: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : single 

(D) TOPOLOGY: unknown 

(ii) MOUECULE TYPE: cDNA 
fiii) HYPOTHETICAL: NO 
(iv) AKTISENSE: NO 

(vj FRAGMENT TYPE: 
{vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 
CTATATTCAT CATAGGAAAC ACCAATG ATATTTTC 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO. 
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(iv) AKTISENSEr NO 

(v) FRAGH^IEMT TYPE: 

(vi) ORIGIKAIi SOURCE: 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:a30: 

CTATATTCAT CATAGC3AAAC ACCAAAGATA TTTTC 35 

(2) INFOiiMATION FOR SEQ ID NO : 13 9: 

^ ° { i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 31 base pairs 
(BJ TYPE J nucleic acid 
(C) STRANDEDNE5S : single 
_ (D) TOPOLOGY t unknown 

(ii> MOLECm^ TYPE: cDMA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISESSE: NO 
(V) FRAGMEST TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 3: 
CTATATTCAT CATAGGAAAC ACCAAAGATG C 31 

(2) INFORMATION FOR SEQ ID NO: 14 0: 

25 

{i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH! 200 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unJcnown 

30 

(ii> MOLECULE TYPE: cDHA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

CTTCCACCGC GATGTTGATG ATTATQTGTC TGAATTTGAT GGGGGCAGGC GGCCCCCGTC 

TOTTTGTCGC GGGTCTGGTG TTGATQGTGQ TTTCCTGCCT TGTCACCCTC QACCTGCAQC 

CCAAGCTTGG GATCCACCAC CATCACCATC ACTAATAATG CATGGGCTGC AGCCAATTtSG 
^0 CACTGGCCGT CGTTTTACAA 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

{ii> MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 


141 


EP 1 164 203 A2 


Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:X4ar 

GTCACCr-nf^ ACCTGCAGCC CAAGCTTGGG ATCCACCACC ATCACCATCA CTAATAATGC 
^ ATGGGCTGCA GCCAATTGGC ACTGGCCGTC QTTTTACAA 

(2) INFORMATION FOR SEQ ID NO i 142; 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 15 base pairs 
<B) TYPE; nucleic acid 

(C) STRANDEDNESS j single 
<D} TOPOLOGY; unknown 

(ii) MOLECULE TYPEt cDNA 
fiiil HYPOTHETICAL; NO 
<iv) ANTISENSE: HO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:l42i 

TGTACGTCAC AACTA 15 

(2) IKPORMATION FOR SEQ ID NO: 143: 

{±) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 1€ base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; MO 
30 (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

35 TGTACGTCAC AACTAC 16 

(2) INFORMATION FOR SEQ ID NO : 144 : 

(1) SEQUENCE CHARACTERISTICS t 
(A) LENGTH: 17 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
tD) TOPOLOGY: unknown 

(ii) MOLECm#E TYPE; cDNA 

(iii) HYPOTHETICAL: NO 
45 (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID t JO H 4 : 

50 TGTACGTCAC AACTACA 17 

(2) INFORMATION FOR SEQ ID NO :14b. 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 base pairs 

{B) TTfPEr nucleic acid 

.. JO STRANDEDNESS : single 

(D) TOPOMGY: unknown 

(11) HOLECXJLE TYPE; CDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB; HO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 
TOTACGTCAC AACTACAA 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
{€) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 
{V) FRAGMENT TYPE: 
(vij ORIGINAL SOURCE: 

<5ci) SEQUENCE DESCRIPTION: SEQ ID NO: 146 
TOTACGTCAC AACTACAAT 

<2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY; unknovm 

(ii) MOLECULE TYPE; cDNA 
<iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOil47 
TGTACQTCAC AACTACAATA 

(2) INFORMATION FOR SEQ ID N0il4S: 

(i) SEQUENCE CHARACTERISTICS J 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 

(D) TOPOLOGY: un3cnown 

(ii) MOLECULE TYPE: cDHA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 
_<T.:v) ORIGINAL SOURCE: 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14B: 
TGTACOTCAC AACTACAATA G 21 
(2) INFORMATION FOR SEQ ID NO; 145: 

10 

(i) SEQUENCE CHAJtACTERlSTICS ; 

(A) LENGTH: 22 base pairs 

(B> TYPE; nucleic acid 

(C) STRANDEDNESS: single 
--<D) TOPOLOGY: unknown 

15 

<ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(vj FRAGMENT TYPE; 
(vi3 ORIGINAL SOURCE; 

20 

(xi) SEQUENCE DESCaiPTION: SEQ ID NO: 149: 
TGTACGTCAC AACTACAATA GG 22 
(2) INFORMATION FOR SEQ ID NO:150 7 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 baa« pairs 
{B} TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

30 

Cii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(v) FRAC34ENT TYPE: 
fvi) ORIGINAL SOURCE: 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 1 150: 
TGTACGTCAC AACTACAATA GGC 23 
(2) INFORMATION FOR SEQ ID NO: 151: 

40 

it) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE; 

ixi> SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
TGTACGTCAC AACTACAATA GGCC 24 
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(2) lOTORMATION FOU SEQ ID NO: 152: 

SEQUENCE CHARACTERISTICS: 
(A^ LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : single 
{D) TOPOLOGY: unknown 

{ii> MOLECULE TYPE: cDHA 

(iii) HYPOTHETICAL: NO 

(iv) AKTISEN5E: NO 

(v) FRAGMENT TYPEi 

(vi) ORIGINAL SOURCE; 

-fxi) SEOUENCE DESCRIPTION: SEQ ID NO: 152 
TGTACGTCAC AACTACAATA GGCCC 

(2) INFORMATION FOR SEQ ID NO: IS 3: 

(i) SEQUENCE CKftRACTERlSTICS ; 

(A) LENGTH; 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKE5S : single 

(D) TOPOLOGY: unknown 

(ii3 MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISBNSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 153 
TGTACGTCAC AACTACAATA GGCCCT 

f2) INFORMATION FOR SEQ ID NO r 154: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 27 base pairs 

(B) TYPE I nucleic acid 

(C) STRANDEDHESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 
fiii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 

Cv) FRAGMENT TYPE; 
Cvi) ORIGINAL SOURCE: 

Cxi} SEQUENCE DESCRIPTION: SEQ ID NO: 154 

TGTACGTCAC AACTACAATA GGCCCTG 

(2) INFORMATION FOR SEQ ID NO: 15 5 

{i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 28 base pairs 
(B) TYPE: nucleic acid 

CO STRANDEDNESS : single 
(Dl TOPOLOGY: unknown 
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(ii) MOLECULE TYPEr CDNA 
Ciii) HYPOTHETICAL: NO 
5 . f 'i v) AKTISENSE; NO 

(V) FRAGMENT TYPE: 
(vi] ORIGINAL SOURCE: 

{xi) SEQUENCE DESCRIPTION; SEQ ID NO; IS 5: 

^0 TGTACGTCAC AACTACAATA GGCCCTGC 2 a 

t2) INFORMATION FOR SEQ ID NO: 156: 

Ci) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 29 base pairs 
-(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLBCOLE TYPE; CDHA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 6; 
TGTACGTCAC AACTACAATA QQCCCTGCA 23 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
^° CC) STRANDEDNESS: single 

(D) TOPOLOGY? unknown 

(ii) MOLECULE TYPE: CDKA 
(tii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

{V) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO r 15 7: 
TGTACGTCAC AACTACAATA GGCCCTGCAC 3 0 

40 

(2> INFORMATION FOR SEQ ID HO: 158: 

(i) SEQUENCE CHARACTERISTICS: 
£A) LENGTH: 31 baso pairs 
<B) TYPE; nucleic acid 

<C) STRANDEDNESS: single 
{D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(vj FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 158: 
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20 


TGTACGTCAC AACTACAATA GGCCCTGCAC C 31 
J 2) INFORMATION FOR SEQ ID NO j 159; 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : single 
CD) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 
(V) FRAGMENT TYPE: 

-tvi) ORIGINAL SOURCE; 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 159: 
TGTACGTCAC AACTACAATA GGCCCTGCAC CA 32 
(2) INFORMATION FOR SEQ ID NO-.160: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

{ ii ) MOLECULE TYPE : . cDNA 

(iii) HYPOTHETICAL: NO 
{iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAG 33 
(2) INFORMATION FOR SEQ ID N0:1&1: 

35 

(ij SEQXreNCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
{C} STRAHDEDNESS: single 
<D) TOPOLOGY: unknown 

40 

iix) MOLECULE TYPE: cONA 
{iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID KO:161t 

TGTACGTCAC AACTACAATA GGCCCTGCAC CAGG 34 

(2) INFORMATION FOR SEQ ID NO: 16 J 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 5 base pairs 
<B) TYPE; nucleic acid 

55 
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iC) STRAKDEDKESS : single 
(D) TOPOtOGY: unknown 

^ (ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) AMTISBNSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: EEQ ID WO; 162 1 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGC 35 
(2) INI?ORMATIOW FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
CO STHANCEDNSSS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 
(iii J HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:l€3: 

TGTACGTCAC AACTACAATA GGCCCTGCAC CAGQCC 36 

(2) INFORMATION FOR SEQ ID NO: 164: 

30 [i) SEQUENCE CHARACTERISTICS; 

<A) LENGTH; 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

35 (ii) fK>LECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE i NO 

(v) FRAGMENT TYPE; 
<vi) ORIGINAL SOURCE; 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCA 37 

(2) INFORMATION FOR SEQ ID NOrlSS: 

45 it] SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 
(BJ TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

so (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 
.. '"1> SEQUENCE DESCRIPTION: SEQ ID NO: 165 1 

5 

TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCAG 3 8 

(2) INFORMATION FOR SEQ ID NO: 1^6; 

(i) SEQUENCE CHARACTERISTICS; 
10 {A> LENGTH: 39 ilistas pairs 

(B) TYPE; nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY; unknown 

4ii) MOLECULE TYPE: cDNA 
15 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 
iv) FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166; 

20 

TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCAGA 39 

(2) INFORMATION FOR SEQ ID NO:lfi7: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LKNQTH: IS base pairs 

CB) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
(D} TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDKA 
30 fiii) HYPOTHETICAL: NO 

(iv) ANTISRNSS: NO 
(vj FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 16 7: 

OS 

CTGATGCGTC GGATCATCC 19 
(2) INFORMATION FOR SEQ ID NO; 166; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH! 20 base pairs 
<B) TYPE: nucleic acid 

{C) STRANDEDNESS: single 
(D> TOPOLOGY: txnknown 

(ii) MOLECULE TYPE: cDNA 
45 <iii) HYPOTHETICAL: NO 

Civ) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 8: 

50 

CTGATGCGTC GGATCATCCA 20 
(2) INFORMATION FOR SEQ ID NO : 169- 
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(i) SEQUENCE CHARACTERISTICS: 
<A} LKNGTH: 21 base pairs 

5 .;-3) TYPE: nucleic acid 

iC) STRANDEDKESS ; single 
[D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
fO (iv> ANTISEKSE: NO 

<v) FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE; 

<xi> SEQtJEHCE DESCRIPTION: SEQ ID HO; 16 9: 
^5 CTGATGCGTC GGATCATCCA G 21 

(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 

20 (B) TYPE; nucleic acid 

(C) STRANDBDNE5S : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 

2s tiv) ANTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 
CTGATGCGTC GGATCATCCA GC 22 
(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNSSS; single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
^. (iv) AKTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
CTGATGCGTC GGATCATCCA GCA 23 
(2) INFORMATION FOR SEQ ID NO: 172: 

ti) SEQXJENCE CHARACTERISTICS; 

(A) LENGTH: 24 base pairs 

(B) TYPE J nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
..^.v.) FRAGMENT TYPE: 

ivi) ORIOXKAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:X72 

CTGATGCGTC GGATCATCCA GCAG 

(2) INFORhEATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS ^ 
(AJ LENGTH: 25 base pairs 
tB) TYPE; nucleic acid 
CO STRANDEDNESS : single 
CD) TOPOLOGY: unknown 

fii) MOLECULE TYPE: cDNA 
fxii) HYPOTHETICAL: NO 

(iv) ANTISENSE: UO 

(v) FRAGMENT TYPE; 

(vi) ORIGINTUj SOURCE: 

{xi) SEQXJEWCE DESCRIPTION: SEQ ID NO : 173 
CTGATGCGTC GGATCATCCA GCAGC 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: un>cnown 

Cii) MOLECULE TYPE? eWA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 

CTGATGCGTC GGATCATCCA GCAGCA 

(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
(R) TYPE: nucleic acid 
{C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

iii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l75: 
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CTGATGCGTC GGATCATCCA GCAGCAG 

^ (2) INFORMATION FOR SEQ ID NO: 176; 

Ci] SEQUENCE CHARACTElilSTICS : 
(A) I.SNGTH: 26 base pairs 
CB) TYPE; nucleic acid 
CO STRANDEDNESS : single 
<D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDWA 
(iiij KYPOTHETICAL : NO 
<iv) ANTISENSEi KO 

(v) FRAGMENT TYPE; 
(Yi) ORXGINAIi SOURCE: 

(Xij SEQUENCE DESCRIPTIONi SEQ ID NO:l76t 
CTGATOCGTC GGATCATCCA GCAGCAGC 

(2) INFORMATION FOR SEQ ID NO:177: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRA17DEDNE&S : single 

(D) TOPOIjOGY; miknown 

(ii) MOLECUl,E TYPE: cDNA 
Uii) HYPOTHETICAL: NO 
(iv) ANTISENSE: MO 
{v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 177: 
CTGATGCGTC GGATCATCCA GCAGCAGCA 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) x^GTH; 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS ; single 

(D) TOPOLOGY; unkiiown 

(ii) MOLECULE TYPE j cDNA 
Uii) HYPOTHETICAL; NO 
(iv) ANTISENSE: NO 

(V} FRAGMENT TYPE; 
{vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 178: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG 

(2) INFORMATION FOR SEQ ID NO : n 

(i) SEQUENCE CHARACTERISTICS: 
tA) LENGTH: 31 base pairs 
(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 


5 


MOLECULE TYPE: cDNA 
{iii> HYPOTHETICAL: NO 
{iv} ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 


10 


(xi) SBQUEKCE DESCRIPTION: SEQ ID NO: 179: 


CTGATGCGTC GGATCATCCA GCAGCAGCAG C 


31 


(2) INFORMATION FOR SEQ ID NO: 180; 


15 


fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY; unJcnown 


20 


(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) PRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 


25 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:lflO: 


CTGATGCGTC GGATCATCCA GCAGCAGCAG CA 

(2) INFORMATION FOR SEQ ID NO:l8l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDMA 

(iii) HYPOTHETICAL 5 NO 

(iv) ANTISEI^SE: HO 
<v) FRWaffiMT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAG 

(2) INFORMATION FOR SEQ ID NO : 132 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 ba^e pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE t 


55 
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(vi) ORIGINAL SOURCE: 
_ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

5 --■ 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGC 34 

(2 J IWFOWW^TIOH FOR SEQ ID NO: 183; 

(i) SEQUENCE CHARACTERISTICS: 
10 <A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNSSS : single 
<D) TOPOIjOGY^ unknown 

Cii) MOLECULE TYPE: cDNA 
15 (iii) HYPOTHETICAL t NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE t 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

£0 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCA 35 

i2) INFORMATION FOR SEQ ID N0;1S4; 

(i) SEQUENCE CHARACTERISTICS; 
25 (A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

Cii) MOLECUUS TYPE: cDNA 
30 (iii> HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi> ORIGINAL SOURCE; 

<xi> SEQUENCE DESCRIPTION: SSQ ID NO: 184: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAG 26 
(2) INFORMATION FOR SEQ ID NO:185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
4S Uii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

50 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGC 37 
(2) INFORMATION FOR SEQ ID NO: 18 6: 

55 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE; nucleic acid 
-" (C) STRANDEDKESS: eingle 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
<iv) ANTI SENSE: NO 

{V) FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO^lSfi 

CTGATGCGTC GOATCATCCA GCAGCAGCAG CAGCAGCA 

<2) INFORMATION FOR SEQ ID NO: IS 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH; 39 base pairs 
(S) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: unjcnown 

(ii> MOLECULE TYPE: cDMA 
(iii> HYPOTHETICAL: NO 

(iv) - ANTISENSE: HO 

(v) FRAGMENT TYPE: 
{vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DBSCRIPTIOIT: SEQ ID NO: 187 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAG 
(2) INFORMATION FOR SEQ ID NO:lfi8; 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 40 base pairs 
<B] TYPE: nucleic acid 

iC) STRANDEDNESS: single 
{D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICTU-i NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xij SEQUENCE DESCRIPTION: SEQ ID NO:L88: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAG C 
(2) INFORMATION FOR SEQ ID NO: 18 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH ? 41 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS: single 
fD) TOPOLOGY i unknown 

(ii> MOLECULE TYPE: CDHA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
^y)^ FRAGMENT TYPE r 
(Vi) ORIGINAL SOURCE: 

(Xi) SSQUENCE DESCRIPTION: SEQ ID NO: 18 9 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC A 
(2) INFORMATION FOR SEQ ID NO; 190: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
-.(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknovm 

tiiJ MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
<iv} ANTISENSE: NO 
{v) FRAGMENT TYPE; 
(vi> ORIGINAL SOtJRCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO ,190 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AG 
(2) INFORMATION FOR SEQ ID NO:l9l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: vanknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0:19X 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGT 
(2) INFORMATION FOR SEQ ID NO: 192: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(ivj ANTISENSE: NO 

{v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

<Xi) SEQUENCE DESCRIPTION; SEQ ID NO:L92; 
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CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTC 
. ._„.; < 2) INFORMATION FOR SEQ ID NO: 193 t 

fi) SBQtTEMCE CHARACTERISTICS; 

(A) LENGTH: 45 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDHESS ; single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEHSE: NO 

(v) FRAGMENT TYPE; 
_^vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION; SEC ID NO: 193: 

CTQATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCA 

{2} INFORMATION FOR SEQ ID NO: 194: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: unknown 

<ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOtJRCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;194r 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCAC 
(2) INFORMATION FOR SEQ ID NO; 195: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 4 7 baae pairs 
(B^ TYPE: nucleic acid 

iC) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACG 

(2) INFORMATION FOR SEQ ID NO: 196 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 base pairs 
<B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown. 

(^li) MOLECOLE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 
(v) FRAGMENT TYPE: 
(viJ ORIGINAL SOURCE: 

{xi] SEQUENCE DESCRIPTION; SEQ ID NO:196: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGC 
(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNE55 : single 

(D) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
CTGATGCGTC GOATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCT 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISSNSE: NO 
<v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 198: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA 
(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH; 51 base pairs 
<B) TYPE: nucleic acid 

to STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

^.vi) SEQUENCE DESCRIPTION: SEQ ID NO:l99; 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA A 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 52 bas« pairs 
{B) TYPE: nucleic acid 
(C) STRANDEDKBSS : single 
{D) TOPOLOGY; unknown 

i±±) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA AC 
(2) INFORMATION FOR SEQ ID NO: 201; 

(i) SEQUENCE CHARACTERISTICS: 
(A3 LENGTH: 53 base pairs 

(B) TYPE^ nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: unScnown 

(ii) MOLECULE TYPE: cDMA 
<iii> HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 

Cxi) SEQtJENCE DESCRIPTION; SEQ ID NO:20l: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACC 
(2) INFORMATION FOR SEQ ID NO; 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Cili) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

Cv) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCQ 
(2) INFORMATION FOR SEQ ID NO:303j 
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Ci) SEQUENCE CHARACTERISTICS r 

(A) LENGTH; 55 base pairs 
H i.(B) TYPE: nucleic acid 
■ ' (C) STIIAWDEDKESS : single 

(D) TOPOIOGY: unknown 

(ii) MOLECULE TYPE^ cDKA 
(iil) HYPOTHETICAL: NO 
<iv) ANTISBNSE: NO 

(v) FRAGMENT TYPEj 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:203: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGA 
(2) INFORMATION FOR SEQ ID NO: 2 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE; nucleic acid 
<C) STRANtlEDNESS r Single 
<D) TOPOLOGY: un)tnown 

(ii) MOLECULE TYPE: CDNA 
Uii) HYPOTHETICAL: NO 
(iv) ANTXSHNSE: HO 

{v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:204; 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAA 
(2) INFORMATION FOR SEQ ID NO; 205: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 57 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; CENA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEN5E: NO 

(v) FRAGMENT TYPE: 
tvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 205: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAAT 
(2) INFORMATION FOR SEQ ID NO:206: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI5ENSE: KO 
(y) FRAGMENT TYPE: 

5 ^ (vi) ORIGINAL SOURCE: 

CxiJ SEQUENCE DESCRIPTION; SEQ ID NO;206: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATC 58 

10 

^2) INFORMATION FOR SEQ ID NO r 207: 
(iJ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

CB) TYPE: nucleic acid 

IS (C) STRANDEDMESS : Single 

(D) TOPOLOGY: unJmown 

(ii> MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
(iv) ANTISENSE; NO 
20 (v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 207: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCC 5 9 

25 

(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 60 base pairs 
(B> TYPE: nucleic acid 

30 iC) STRAHDBDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

35 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID W0i208: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 6 0 

40 

(2) INFORMATION FOR SEQ ID NO: 209: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS ; single 

(D) TOPOLOGY: urtknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
50 FRAGMENT TYPE; 

(Vi) ORIGINAL SOURCE: 

(xi) SEQtmrCE DESCRIPTION; SEQ ID NO: 2 09; 
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CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
T SI 

5 "(2) INFORMATIOH FOR SEQ XD NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 <D) TOPOLOGY: unknovm 

Cii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 
-(v) FRAGMENT TYPE: 
15 (vi) ORIGINAL SOTOCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 10: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 6 0 

(2) INFORMATION FOR SEQ ID NO: 211: 


20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE; nucleic acid 
25 iC) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; CDNA 
Ciii) HYPOTHETICAL; NO 
<iv) ANTT SENSE: HO 

30 Cv) FRAGMENT TYPE: 

(vi) ORIGINAL SOtmcE: 

(xi] SEQUENCE DESCRIPTION: SEQ ID N0:211: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
35 TGG €3 

(2) INFORMATION FOR SEQ ID NO:212: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 64 base pairs 

40 (B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL T NO 
45 (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ZD NO ■ 2 1 2 : 

50 CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC A^TTCACGCTA ACCGAATCCC 6 0 

TGGT 64 

(2) INFORMATION FOR SEQ ID NO : 2 1 3 
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(i) SEQUENCE CHAIIACTBRISTICS : 

(A) LENGTH: 65 base .pairs 
- , — ,-(B) TYPE: nucleic acid 

5 (C) STKANDEDHESS : single 

(D) TOPOI-OGY: unknown 

(ii) MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
10 Cv) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 213; 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
15 TGGTC 6 5 

(2) INFORMATION FOR SEQ ID NO: 2 14 J 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 66 base pairs 

£0 (BJ TYPE; nucleic acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: unJoiOvm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 
25 (iv) ANTISEHSE: NO 

<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

txi) SEQUENCE DESCRIPTION: SEQ ID NO:2l4: 

30 CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 

TGGTCA 

(2) INFORMATION FOR SEQ ID NO: 215; 

ii) SEQUENCE CHARACTERISTICS: 
35 <A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii> MOLECULE TYPE; CDNA 
40 <iii} HYPOTHETICAL: NO 

(iv> ANTISEITSE: NO 
Cv) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

45 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCAO g 7 

(2) INFORMATION FOR SEQ ID NO: 216: 

50 Ci) SEQUENCE CHARACTERISTICS: 

<A) LENGTH; 66 base pairs 
(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 

'^1 MOLECULE TYPE; CDNA 
^ fiii) HYPOTHETICJUj ; NO 

(iv) ANTISEWSE: no 

(v) .FRAOMEOT TYPE; 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0:216: 

10 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 

tggtcaga es 

{2) INFORMATION FOR SEQ ID NO: 2 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

^0 (ii) MOLECtTLE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

25 Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCAGAT 6 9 

(2) INFORMATION FOR SEQ ID NO: 218: 

30 

U) SEQUENCE CHARACTERISTICS ^ 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY; unknown 

35 

(ii) MOI.ECULE TYPE; cDNA 

(iii) HYPOTHETICAL: MO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

40 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO 1218: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCAGATC 70 

45 (2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 
50 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iii» HYPOTHETICAL: NO 
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(iv) ANTlSENSEr HO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 1 219: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 
TGGTCAGATC T 

(2) INFORMATION FOR SEQ ID NO; 22 0: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIjOGV; unknown 

(ii) MOLECDLE TYPE: CDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 
tv> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 
TGGTCAGATC TT 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: X2 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY; unJcnown 

(ii) MOLECULE TYPE: cDNA 
CiiiJ HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(v> FRAGMEirr TYPE: 
(vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 
TGCACCTGAC TCC 

(2) INFORMATION FOR SEQ ID NO; 222: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 14 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
Cv) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 22: 
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TGCACGTGAC TCCT 14 
(2) INFORMATION FOR SEQ ID NO:223: 

5 ■ - ^ 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 15 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS ; single 
{D) TOPOLOGY: unknown 

10 

(ii) MOLECtnUE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

( v) FRAGMENT TYPE : 
(yi) ORIGINAL SOURCE; 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 
TGCACCTQAC TCCTG 15 
(2) INFORMATION FOR SEQ ID NO; 224: 

(j.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEOKESS: single 

(D) TOPOLOGY : unknown 

25 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE, NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
TGCACCTGAC TCCTGT 16 
(2) INFORMATION FOR SEQ ID NO:22S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
TGCACCTGAC TCCTGTG 17 
(2) INFORMATION FOR SEQ ID NO: 226; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDED^fESS : single 
(0) TOPOLOGY: unknown 

5 vxi) MOLECULE TYPE: cDNA 

(iii) HyPOTHETICAL: NO 

(iv) ANTISENSE: NO 
Cv) FRAGMENT TYPE; 
(Vi) ORIGINAL SOURCE: 

to (x±) SEQUENCE DESCRIPTION; SBQ ID NO: 226: 

TGCACCTGAC TCCTGTGG 

(2) INFORMATION FOR SEQ ID NO: 227: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

20 (ii) MOI*ECULE TYPEt cDNA 

(iii J HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

TGCACCTGAC TCCTGTGGA 19 

(2) INFORMATION FOR SEQ ID NO: 226: 

30 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 
{B) TYPE J nucleic acid 

(C) STRANDEDKESS : single 

(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: CDMA 

£iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
^Vi> ORIGINAL SOURCE; 

40 (Jti) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

TGCACCTGAC TCCTGTGGAG 20 

(2) INFORMATION FOR SBQ ID NO: 22 9: 

45 Ci> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: liiiknown 

so MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPEi 

55 
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(vi) ORIGINAL SOURCE: 

SEQUENCE DESCRIPTION; SEQ ID NO: 22 9 
TQCACCTGAC TCCTGTQGAG A 

{2} INFORMATION FOR SEQ ID NO: 23 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAC&1ENT TYPE; 
(vi) ORIGINAL SOURCE: 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO; 230 
TGCACCTGAC TCCTGTGGAG AA 

(2) INFORMATION FOR SEQ ID NO: 231: 
(1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNHSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE ^ NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231 
TGCACCTGAC TCCTGTGGAG AAG 

{2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: CDNA 
{iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 32: 
TGCACCTGAC TCCTGTGGAG AAGT 

(2) INFORMATION FOR SEQ ID NO:233: 
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(i) SEQUENCE CHAIIACTERISTICS : 
{A) LENGTH: 25 base pairs 
(B) TYPE; nucleic acid 

"(C) STRANDEDNESS ; single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDKTA 

(iii) HYPOTHETICAL! NO 
<iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:233 
TGCACCTGAC TCCTGTGGAG AAQTC 

C2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 26 ba£^e pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS; single 
(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: CDNA 
CiiiJ HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 

<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) CEQUENCE DESCRIPTION: SEQ ID NO: 234 
TGCACCTGAC TCCTGTGGAG AAGTCT 

(2) INFORMATION FOR SEQ ID NO:235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRMMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 
TGCACCTGAC TCCTGTGGAG AAGTCtG 

(2) INFORMATION FOR SEQ ID NO:236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2B base pairs 

(B) TYPE; nucleic acid 
(c) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii> HYPOTHETICAL: HO 
(iv) AKTI SENSE: NO 
FRAGMEirr TYPE : 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 
TGCACCTGAC TCCTGTGGAG AAGTCTGC 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQOENCE CHARACTERISTICS i 
Ca) LENGTH; 29 base pairs 
(B) TYPE: nucleic acid 

'-(C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
<V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 
TGCACCTGAC TCCTGTGGAG AAGTCTGCC 

(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSt single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
Uv) ANTISENSE: NO 

(V) FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE: 

{xi) SEQUENCE* DESCRIPTION: SEQ ID NO:23a; 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG 

(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic Acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDWA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOtJRCE ; 

(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 2 3?; 
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TGCACCTGAC TCCTGTGQAG AAGTCTGCCG T ■ 31 

(2) INFORMATION FOR SEQ ID NO:240: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) l^ENOTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 
<r) TOPOLOGY: unknown 

10 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL; HO 

(iv) ANTISENSE: NO 

(v) FRAGMEI7T TYPE: 
Ay±) ORIGINAL SOURCE: 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 
TGCACCTGAC TCCTGTGOAG AAGTCTGCCG TT 32 
(2) INFORMATION FOR SEQ ID NO: 241: 

20 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
tiv> ANTXSENSE: HO 

(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:241: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTA 33 
<2) INFORMATION FOR SEQ ID NO; 24 2: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nuclaie acid 

(C) STRANDEDNES5 : single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:242: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTAC 34 
(2) INFORMATION FOR SEQ ID NO: 243: 

so 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3 5 base pairs 

(B) TYPE: nucleic acid 
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fC) STRANDEDNESS r single 
(D) TOPOLOGY: unknown 

'Ui) MOLECXJLE TYPE: cDNA 

(iii) HYPOTHETICAL,: NO 

(iv) ANTISEWSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACT 

(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 36 base paix-s 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

fii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL; NO 
av> ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTG 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGC 

(2) INFORMATION FOR SEQ ID NO: 24 6; 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

. (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 46 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCC 

(2) INFORMATION FOR SEQ ID NO; 247: 

<i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 39 base pairs 
iB) TYPE: nucleic acid 
{C) 5TRANDEDNE5S : single 
(D) TOPOLOGY: unknown 

tii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEKSE: NO 
<V} FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCC 

(2) INFORMATION FOR SEQ ID NO: 248: 

fi) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANTEDNESS : single 

(D) TOrOLOGY: untaown 

(ii) MOLECXJLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE I 

Cxi) SEQUENCE DESCRIPTION; SEQ ID NO; 24 8: 
TGCACCTGAC TCCTGTGOAG AAGTCTGCCG TTACTGCCCT 
(2) INFORMATION FOR SEQ ID NO: 24 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL; NO 
<iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 9: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT G 
(2) INFORMATION FOR SEQ ID NO: 2 50: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 42 base pairs 

V 7'^) TYPE: nucleic acid 
5 To STRANDEDNESS r single 

C D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 
(iiij HYPOTHETICAL; NO 
(tv> ANTISENSE; NO 

10 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEOUENCE DESCRIPTION: SEO XD NO: 250; 
TGCACCTGAG TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GT 42 

15 

(2 J INFORMATION FOR SEQ ID N0i2Sl: 

[i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unioiown 

Ui) MOLECULE TYPE: cDNA 
tiii) HYPOTHETICAL: NO 
(iv) ANTISENSE; NO 
25 fv) FRAGMENT TYPE: 

(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTG 43 

30 

(2) INFORMATION FOR SEQ ID NO; 2 52: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 base pairs 
(BJ TYPE: nucleic acid 

35 CO STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 
tiv) ANTISENSE: NO 

4^ [v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

[xi) SEQUENCE DESCRIPTION: SEQ ID NO;252: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGG 44 
(2) INFORMATION FOR SEQ ID NO:253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE; nucleic acid 
so (C^ STRANDEDNESS! single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE; cDNA 

55 
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(iiij HYPOTHETICAL; NO 
(iv) ANTISENSE: MO 
..,.^.'y) FRAGMENT TYPE: 
5 (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253^ 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGG 4 5 

^0 (2) INFORMATION FOR SEQ ID NO:254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 46 base pairs 

(B) TYPE: nucleic acid 

- (C) STRANDEDNESS : single 
^5 (D> TOPOZ/X3Y: unknown 

(ii) MOIiECmE TYPE; cDHA 
{iii) HYPOTHETICAL: NO 
(iv) ANTISENSE; NO 

Cv) FRAGMENT TYPE: 
20 (vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGG 46 

25 {2) INFORMATION FOR SEQ ID NO:255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

35 (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
TGCACCTGAC TCCTGTG<5AG AAGTCTTCKTCG TTACTGCCCT GTGGGQC 47 
40 (2) INFORMATION FOR SEQ ID NO:25S: 

(i) SEQUENCE (HIARACTEIIISTICS : 
(A) LENGTH: 48 b&se pairs 
CB) TYPE; nucleic acid 

CO STRANDEDNESS; Single 
45 (D) TOPOLOGY; xinknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

50 (vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO 2^6: 
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TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCA 48 
^,^^..J2) INFORMATION FOR SEQ ID NO: 257: 

^ (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 4 9 base pairs 

(B) TYPEi nucleic acid 

(C) 5TRAI«DEDNESS : single 
<D) TOPOMGY: iinltnown 

(ll) MOLECULE TYPE: c:DNA 
Ciii) HYPOTHETICAL: NO 

(iv) ANTISEHSE: NO 
{v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi> SEQUENCE DESCRIPTION! SEQ ID NO ; 2 57: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAA 4 9 

(2) INFORMATION FOR SEQ ID NO: 2 56; 

Ci) SEQUKNCE CHARACTERISTICS; 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
iiv) ANTISEHSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

^° (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 58: 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG 50 
(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

^° (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 2 59: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG G 51 
(2) INFORMATION FOR SEQ ID NO: 260: 

^° ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 5 2 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAWDEDNESS : single 
CD) TOPOLOGY: unknown 

5 \xi) MOLECULE TYPE: cDWA 

Ciii) HYPOTHETICAL r KG 
tiv) AirriSENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 260 : 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG GT 52 

{2) INFORMATION FOR SEQ ID N^0:261: 

15 {i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 209 base pairs 
<B) TYPE: nucleic acid 
<C) STRAlflDEDNESS: single 

(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: HO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE; 

25 SEQUENCE DESCRIPTION; SEQ ID NO;2ei: 

CATTTGCTTC TGACACAACT GTGTTCACTA GCAACCTCAA ACAGACACCA TGGTGCACCT 60 
GACTCCTGTG GAGAAGTCTG CCGTTACTGC CCTGTGGGGC AAGGTGAACG TGGATGAA3T 120 
TGGTGGTGAG GCCCTGGGCA GGTTGGTATC AAGGTTACAA GACAGGTTTA AGQAGACCAA ISO 
TAGAAACTGG GCATGTGGAG ACAGAGAAG 209 

30 

{2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 88 base pairs 
{B) TYPE: nucleic acid 

35 <C> STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(±V) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

txi) SEQUENCE DESCRIPTION: SEQ ID NO; 262: 

TGAGACTCTQ TCTCAAAAAT AAATAAATAA ATAAATAAAT AAATAAATAA ATAAATAAAT 6 0 
AAATAAATAA GTAAAAAAGA AAGAATGC 68 

(2) INFORMATION FOR SEQ ID NO: 26 3: 

(i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRAWDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
. CV'V ANTISENSE; NO 

5 <v) FRAC3MENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 
GTQTGTGTGT GTGTGTGTT TTTTTTTAAC AGGGATTTGG GGAATTATTT GAGA 54 

10 

(2) INFORMATION FOR SEQ ID NO:264: 

Ci) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 24 base pairs 
-(B) TYPE: nucleic acid 
/5 (C) STRANDEDNESS ; a ingle 

(D) TOPOLOGY t urOcnown 

(ii) MOLECULE TYPE: cDKA 
(lii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
20 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: 
TTCCCCAAAT CCCTGTTAAA AAC 23 

25 

(2) INFORMATION FOR SEQ ID NO:2€5: 

CD SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
3Q (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
33 (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2e5: 
TTCCCCAAAT CCCTGTTAAA AAAAC 25 
(2) INFORMATION FOR SEQ ID NO;266: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 2 7 base pairs 

^5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 266: 
TTCCCCAAAT CCCTGTTAAA AAAftAAC 27 

5 

{2) INFORMATION FOR SEQ ID NO;267: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 103 base pairs 
{B) TYPE: nucleic acid 

W <C) STRANDEDNESS : single 

<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL! NO 
iiv) ANTISENSE: NO 

15 Cv) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

{Xi) SEQUENCE DESCRIPTION: SEO ID NO:267r 

GTAAAACGAC CGCCAGTGCC AAGCTTGCAT GCCTGCAGGT CGACTCTAGA GGATCCCCGG 60 
20 GTACCGAGCT CGAATTCGTA ATCATGGTCA TAGCTGTTTC CTG 103 


<2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS T single 

(D) TOPOLOGY: unknovnn 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
{v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 8: 

GAGTCAGGTG CGCCATGCCT CAAACAGACA CCATGGTQCA CCTGACTCCT GAGGAGNCTG 6 0 
GGCATGTGGA GACAGAGA 78 


(2) INFORMATION FOR SEQ ID NO: 269: 

Ci) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 78 base pairs 
(B> TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 9: 
TCTCTGTCTC CACATGCCCA GNCTCCTCAG GACTCAGGTG CACATGGTGT CTGTTTGAGG 6 0 
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25 


CATGGCGCAC CTGAGCTC 78 
(2) INFORMATION FOR SEQ ID NO: 270: 

^ {i) SEQUENCB CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: unknown 

.... 

fit) MOLECULE TYPE; cDMA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(v.i) ORIGINAL SOURCE; 

(Xi) SEQXJENCE DESCRIPTION: SEQ IDNO:270: 

TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CGCCATGGTG TCTGTTTOAG 60 
GCATGGCGCA CGTGACTC 76 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQXJENCE CHARACTERISTICS: 
CA) LENGTH: 82 base pairs 
(B) TYPE; nucleic acid 

CO 5THANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(Xi) SEQUENCE DESCRIPTION; SEQ IDNO:271: 

TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CGCCATGGTG TCTGTTTGAG 60 
GCATGGCGCA CCTGACTCCT GA 82 

^ (2) INFORMATION FOR SEQ ID NO: 272: 

{!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CG 42 
(2) INFORMATION FOR SEQ ID NO: 273: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LEKGTHi 13 base pairs 

(B) TYPE: nucl«ic acid 

(C) STRA1?DEDHESS: single 
5 CD) TOPOLOGY: unknown 

(li) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICJUj: NO 

(iv) AKTISENSE; NO 
iv} FRAGMEHT TYPE: 

10 (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27a: 
CACCTQACTC CTA 13 
15 (2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDBDNESS : single 
20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 
{^r) FRAGMENT TYPE; 

25 <vij ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:274: 
CACCTGACTC CTGGA 14 
30 (2} INFORMATION FOR SEQ ID NO: 2 75: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

40 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:275: 
CACCTGACTC CTGA 14 
(2) INFORMATION FOR SEQ ID NO: 2 76: 


4S 


SO 
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ii) SEQUENCE CHARACTERISTICS: 
(Al LENGTH: 26 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) AKTISENSE: NO 

(v) F31AGMENT TYPE: 
■'-'".) ORIGINAL SOURCE; 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27€; 
CCATGGTGTC TGTTTGAGGC ATGGCG 26 
(2) INFORMATION FOR SEQ ID N0:277; 

10 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 75 base pairs 

(B] TYPE: nucleic acid 

(C] STRANDEDNESS : single 
--(D) TOPOLOGY: unknown 

15 

(ii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 277: 

CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 
AGTCATCTTG GGGCT 75 

2s (2) INFORMATION FOR SEQ ID NO: 2 78: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 61 base pai.:s 
<B> TYPE: nucleic acid 

(O- STRANDEDNESS : single . 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOmCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27S: 

ACCTAGCGTT CAGTTCGACT GAGATAATAC GACTCACTAT AGCAGCTCTC ATTTTCCATA 60 
C 61 

^° (2) INFORMATION FOR SEQ ID NO:279; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base paira 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 
tiv) AIOTISENSE: NO 

[v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO 2 7 * : 
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10 


15 


25 


GUCACUACAG GUGAGCUCCA 2 0 

(2^K. INFORMATION FOR SEQ ID NO ; 280: 

Ci) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 24 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: uiUcnown 

Cii) MOLECULE TYPE: CDNA 
Ciii) HYPOTHETICAL; NO 
Civ) ANTISENSEt NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOr280: 
CTCAGTCCAC GTGGTACCCT GCTG 24 
(2) INFORMATION FOR SEQ ID NO;231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: unJoiown 

(ii) MOLECULE TYPE: cIJNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

30 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

CATTTGCTTC TGACACAACT GTGTTCACTA GCAACCTCAA ACAGACACCA TGGTGCACCT 
GACTCCTGAG GAGAAGTCTG CCGTT 85 

35 

{2) INFORMATION FOR SEQ ID NO:282: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTHt 22 base pairs 

40 fB) TYPE; nucleic acid 

(C) STRANDEDNESS ; Single 

(D) TOPOLOGY: \mknown 

(ii) MOLECULE TYPE: cDNA 
<iii> HYPOTHETICAL: NO 

45 <iv) AHTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi> SEQUENCE DESCRIPTION; SEQ ID NO: 232; 
50 ACGGGTCCCG GAGTGGTGTC GC 22 

{2) INFORMATION FOR SEQ ID NO:2S3: 
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Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7S base pairs 
,,.-(B) TYPE: nucleic acid 

5 ' (C) STRANDEDNESS : single 

(D) TOPOLOGY: uftknovm 

tii) MOLECULE TYPE: cDNA 

[iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
10 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283: 

ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 60 
15 GTATCAAGGT TACAAG 76 

(2) INFORMATION FOR SEQ ID NO:2e4: 

fi) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 76 bas« pairs 
20 (B) TYPE: nucleic acid 

{C} STRANDEDNESS : Single 
(DJ TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
25 Uv) ANTISENSE: NO 

{v) FRAGMENT TYPE: 
tvi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:284: 

30 ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGATTG 60 

GTATCAAGGT TACAAG 76 

(2) INFORMATION FOR SEQ ID NO; 28 5; 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH; 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS i single 

(D) TOPOLOGY: unJoiown 

(ii) MOLECULE TYPE: cDHA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOXmCE: 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 55; 

''^ ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG €0 

GTATCAAGGT TACAAG 76 

(2) INFORMATION FOR SEQ ID NO:2e6: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS : single 
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(D) TOPOLOGY: unknown 

Cii) MOLECUl^E TYPE; CDNA 
' HYPOTHETICAL: NO 

Civ) ANTISENSE: NO 

(v) FRAGMEOT TYPE: 

(vi) ORIGINAL SOXJRCE; 

(xi) SEQOENCE DESCRIPTION: SEQ ID NO: 286: 

ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 60 
GCATCAAGGT TACAAG 76 

C2} INFORMATION FOR SEQ ID NO:287: 

Xi) SEQUENCE CHARACTERISTICS: 
{A> LENGTH: 4 6 baa© pairs 
{B) TYPE: nucleic acid 
(C) STRANDEDNESS i single 
{D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDHA 
tiii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

25 

ACAGGTTTAA GGAGACCAAT AJGAAACTGGG CATGTGGAGA CAGAGAAG 43 
(2) INFORMATION FOR SEQ ID NO: 2 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDRA 
35 <iii) HYPOTHETICAL! NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOi2aa: 

40 

GACGACGACT GCTACCTGAC TCCA 24 
(2) INFORMATION FOR SEQ ID NO: 28 9: 

(i) SEQU^ENCE CHARACTERISTICS;. 

(A) LENGTH: 24 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIjOGY: unknown 

(ii) MOX*ECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: WO 

(v) FRAGMENT TYPE: 
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10 


(yi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2S9: 

ACAGCGCACT GCTACCTGAC TCCA 24 
(2) INFORMATION FOR SSQ ID NO: 290; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

^5 (ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 
<v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOr290i 
TGGAGTCAGG TAGCAGTC IB 
(2) INFORMATION FOR SEQ ID NO: 2 91: 

Ci) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: €0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unlcnown 

(ii) MOLECULE TYPE: CDNA 
^° <iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 
Cvi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 91: 

35 

CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 

(2) INFORMATION FOR SEQ ID NO: 292: 

{i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: ^5 base pairs 
CB) TYPE; nucleic acid 
CO STRANDEDNESS: single 
CD> TOPOLOGY: unknown 

(ii) MOLECULE TYPEi cDKA 
Ciii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 
Cv) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 2 92: 

50 

AGTCATCTTG GGGCTGTCGA GAGTAAAAGG TATGTCAGTC ATAGTTAAGA CCTTCTTAAA 
GGTCT 6 5 
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[2) INFORMATION FOR SEQ ID NO: 293; 

(i) SEQUENCE CHARACTERtSTICS : 
- (A) LENGTH: 25 base pairs 

(B) TYPEi nucleic acid 

(C) STRAHDEDNESS : single 

(D) TOPOiiOGY: xmJcnown 

(ii) MOLECULE TYPEi cDNA 

(iii) HYPOmBTICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Jxi) SEQUENCE DESCRIPTION: SEQ IP NO:293: 
GTAATTTCTA TCAGTAGAAC CCOGA 

(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: €0 ba$« pairs 

(B) TYPE: nucleic acid 

( C) STRAKDEDNESS t single 
CD) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 94; 
CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 
(2) INFORMATION FOR SEQ ID NO;2&5: 

(il SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: HO 

(iv) ANTISENSE: NO 

(v) FRJiaMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:C95r 
AGTCATCTTG GGGCT 

(2) INFORMATION FOR SEQ ID NO;296 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) XiENGTH: 60 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPEi cDNA 

(iii) HYPOTHETICAL; NO 
v'.i-v}.- ANTI SENSE: NO 

5 (v)' FRAGMENT TYPE: 

(vi) ORIGINAL SOURCB; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 
CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 

fO 

(2) INFORMATION FOR SEQ ID NO:297i 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 

"(B) TYPE: nucleic acid 
fs iC) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEN5E: NO 
20 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 
AGTCATCTTG GGGCTA 16 

25 

(3) INFORMATION FOR SEQ ID NO:25S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 
30 <C) STRANDEDNESS: single 

CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
05 Cv) FRAGMENT TVPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
CAGCTCTCAT TTTCCATACA TTAAAGATAG TCy^TCTTGGG OCT 43 

40 

(2) INFORMATION FOR SEQ ID NO: 299 r 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE; nucleic acid 
^5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
50 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 99: 
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CAGCTCTCAT TTTCCATACA TTAAAGATAG TCATCTTGGG GCTA 44 
(MISINFORMATION FOR SEQ ID NO: 300: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH^ 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: single 
CD) TOPOLOGY: unknown 

^° (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
{iv> ANTISENSE: NO 

<v) FRAGMENT TYPE: 
(yj,) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ XD NO; 300: 

CAGCTCTCAT TTTCCATACA GT 22 

(2) INFORMATION FOR SEQ ID NO: 301; 

^° (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: oDNA 
<tii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
{v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO:301: 
CAGCTCTCAT TTTCCATACA T 21 
<2) INFORMATION FOR SEQ ID NO: 302; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

Cii) MOLECULE TYPE: cENA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 
tvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 
GCCTGGTACA CTGCCAGGCG CTTCTGCAGG TCATCGGCAT CGCGGAGGAG 50 
(2) INFORMATION FOR SEQ ID NO:303 : 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE; nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

5 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AHTISENSE: NO 

(v) FRAGMEKT TYPE; 

(vi) ORIGINAL SOURCE: 

10 (xil SEQUENCE DESCRIPTION: SEQ ID :3O;303: 

GCCTGGTACA CTGCCAGGCA CTTCTGCAGG TCATCGGCAT CGCGaAGGAG 50 
<2) INFORMATION FOR SEQ ID NO: 3 04: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS : Single 
<D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 04: 

GATGCCGATG ACCTGCAQAA G 21 

(2) INFORMATION FOR SEQ ID NO; 3 05; 

30 

(i) SEQUENCE CHARACTERISTICS; 
<A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
<C} STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 
GATGCCGATG ACCTGCAGAA GC 22 
(2) INFORMATION FOR SEQ ID NO; 3 06: 

45 

(i> SEQUENCE CHARACTERISTICS; 
<A) LENGTH: 24 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

50 

(ii) MOl^ECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE:. 

(vi) ORIGINAL SOURCE: 

" "'xxi) SEQXreNCE DESCRIPTION: SEQ ID NO: 3 06 
GATGCCGATG ACCTGCAGAA GTQC 

(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : single 
{D) TOPOLOGY r tinkncvm 

iii) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 07 
GATOATCCGA CG 


{2} INFORMATION FOR SEQ ID NO: 308 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: \inknown 

(ii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 
CTGATGCGTC GQATCATC 

(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE; nucleic acid 
(C> STRANDEDNESS: single 
(D> TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 
{iii> HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
tv) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309 
GATGATCCGA CG 
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(2) INPOfiMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CKARACTERISTICS : 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: un)a30wn 

(ii) MOLECULE TYPE; CDNA 

(iii) HYK>THETICAL: NO 

(iv) ANTISENSE: WO 
(V) FRAGMENT TYPEi 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 

GGCGCGGACA TGGAGGACGT GTGCGGCCGC CTGGT 

(2) rMPORMATlOM FOR SEQ ID NO: 311: 

(±) SEQUENCE CHARACTERISTICS; 
<A) LENGTH: 34 base pairs 

(E) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGraNT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 
TCCGCGATGC CGATGACCTG CAGAAGCGCC TGGC 

(2) INFORMATION FOR SEQ ID NO: 312: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIiOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv> ANTISENSEi NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 

CGGCTGCGAT CACCGTGCGG CACAGCT 

( 2 ) INFORMATION FOR SEQ ID NO : 3 1 i 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 27 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS; single 
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(D) TOPOLOGY: unknown 

.(ii) MOLECULE TYPE: cDNA 
5 (Hi) HYPOTHETICAL T NO 

(iv) ANTISENSE: MO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

txi) SEQUENCE DESCRIPTION: SEQ ID NOr3l3: 

10 

CGGCTGCGAT CACCGTGCGG T 21 

(2) INFORMATION FOR SEQ ID NO; 314: 

[L) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 27 base pairs 

<B) TYPE; nucleic acid 
to STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDHA 
20 (iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 
(V) FRAjCTflENT TYPE: 
(vi) ORIGINAL SOURCE; 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:314: 

25 

CGGCTGCGAT CACCGTGCGG AACAGCT 2 7 

(2) INFORMATION ZFOR SEQ ID NO: 3 15: 

(i> SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 22 base pairs 

CB) TYPEr nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: xinknown 

(ii) MOliECULB TYPE: cDNA 
35 (iii) HYPOTHETICAL: WO 

(iv) ANTISENSE: HO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

40 

CGGCTGCGAT CACCGTGCGG CA 22 

(2) INFORMATION FOR SEQ ID NO; 316: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un3aiown 

fix) MOLECULE TYPE: cDNA 
50 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE; 

55 
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(Xi> SEQUENCE DESCRIPTION: SEQ ID NO; 316: 
CGGCT<3ri?.>,'^- CACCGTGCGG TA 22 
(2) INPORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH T 21 bade pairs 
(BJ TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iii> HYPOTHETICAL: NO 
<iv) ANTISEKSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOXJRCE: 

(Xi) SEQUENCE DESCRIPTION; SEQ ID N0:317: 
CGGCTGCGAT CACCGTGCGG A 21 

20 

(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 42 base paira 

(B) TYPE: nucleic acid 
CO STHANDEr»IESS : single 
(D) TOPOLOGY; unknown 

<ii) IVtOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 
30 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOOTCB; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318; 
ATCATCAACT GGAAGATCAG GTCAGGAGCC ACTTGCCANC CT 42 

33 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 33 base pairs 
(B) TYPE: nucleic acid 

40 ic) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
45 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 
ATCATCACAC TGGAAGACTC CAGGTCAGGA GCC 33 

50 

(2) INFORMATION FOR SEQ ID NO: 32 0: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 48 base paire 

(B) TYPE: nucleic acid 
_^(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 


(ii> MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
{iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
10 (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 320: 

ATCCACTACA ACTACATGTG TAACAGTTGG wGC*aX3CC 


Claims 


1 . A process for determining the order of base specifically terminated nucleic acid fragments of a target nucleic acid 
20 molecule, comprising the steps of: 

a) obtaining a nucleic acid molecule, comprising the target nucleic acid sequence and, at one end, a tag; 

b) generating base specifically terminated nucleic acid fragments from the target nucleic acid; and 

c) analyzing the fragments by a mass spectrometry fonmat, thereby detennining the order of the base specif- 
^5 ically terminated nucleic acid fragments in the target nucleic acid molecule. 

2. A process of claim 1 , wherein In step b), a nuclease is contacted with the target nucleic acid to generate the base 
specifically terminated nucleic acid fragments. 

30 3. A process of claim 2, wherein the nuclease is a restriction enzyme that can recognize and cleave at least one 
restriction site in the target nucleic acid. 

4. A process of claim 2, wherein the target nucleic acid is a deoxyribonucleic acid and the nuclease is a deoxyribo- 
nuclease. 

35 

5. A process of claim 2, wherein the target nucleic acid is a ribonucleic acid and the nuclease Is a ribonuclease. 

6. A process of claim 5, wherein the ribonuclease is selected from the group consisting of: the G-specif ic ribonu- 
clease, the A-specific ribonuclease, the A/U specific PhyM ribonuclease, the U/C specific ribonuclease A, the 

40 C-specific chicken liver ribonuclease and crisavitin. 

7. A process of claim 1 , wherein in step b), the base specifically terminated nucleic acid fragments are generated by 
performance of a combined amplification and base-specific tenmination reaction. 

45 8. A process of claim 7, wherein the combined amplification and base-specific termination reaction is performed using 
a first polymerase, which has a relatively low affinity towards at least one chain tennninatlng nucleotide, and a 
second polymerase, which has a relatively high affinity towards at least one chain terminating nucleotide. 


9. A process of claim 8, wherein the first and second polymerases are thermostable DNA polymerases. 

50 

10. A process of claim 9, wherein the thermostable DNA polymerases are selected from the group consisting of: Taq 
DNA polymerase, AmpNTaq FS DNA polymerase, Deep Vent (exo-) DNA polymerase, Vent DNA polymerase, Vent 
(exo') DNApolymerase^ Vent DNA polymerase, Vent (exo') DNA polymerase, Deep Vent DNA polymerase, Thermo 
Sequenase, exo(-) Pseudococcus funosus (Pfu) DNA polymerase, AmpllTaq, Ultman, 9 degree Nm, Tth, Hot Tub, 

55 Pyrococcus funosus (Pfu) and F^rococcus woesei (Pwo) DNA polymerase. 


11 . A process of claim 1 , wherein the base specifically terminated nucleic acid fragments generated in step b) include 
mass modified nucleotides. 
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12. A process of claim 1 , wherein the tag comprises a 3' tag. 

13. A process of ciaim 1 , winerein tine tag comprises a 5' tag. 

14. A process of ciaim 12 or 13, wherein the tag is a non-naturai tag. 

15. A process of ciaim 14, wherein the non-natural tag is selected from the group consisting of: an affinity tag and a 
mass marlter. 

16. A process of claim 15, wherein the affinity tag facilitates immobiiization of the nucieic acid to a solid support. 

17. A process of claim 16, wherein the affinity tag is biotin or a nucieic acid sequence that is capable of binding to a 
capture nucleic acid sequence that is bound to a solid support. 

1 8. A method for detecting neoplasia/malagnancies in a tissue or celi sample, comprising detecting telomerase activity, 
mutation of a proto-oncogene, or expression of a tumor specific gene in the sample by detecting nucleic acids that 
encode the telomerase, that are specific for the mutation or that encode the tumor-specific by mass spectrometry. 

19. The method of ciaim 1 8 that is a method for detecting a neoplasla/malignancy in a tissue or cell sample, comprising: 

a) isolating telomerase from the sample and adding a synthetic DNA primer, which is optionally immobilized, 
complementary to a telomeric repeat, and all four deoxynucleotide triphosphates under conditions that result 
in telomerase specific extension of the synthetic DNA; 

b) amplifying the telomerase extended DNA product^ and 

c) detecting the DNA product by mass spectrometry, wherein telomerase-specific extension is Indicative of 
neoplasia/malignancy. 

20- The method of claim 19, wherein the primer contains a lini<er moiety for immobilization on a support; and the 
amplified primers are isolated by conjugating the linker portion to a solid support. 

21. A process for detecting telomerase activity in a biological sample, comprising the steps of: 

a) incubating the biological sample; a substrate primer, which can be extended by telomerase activity; and a 
complete set of deoxynucleoside triphosphates; and 

b) detecting a telomerase extended substrate primer by mass spectrometry, thereby detecting telomerase 
activity in the biological sample. 

22. The process of claim 21 , wherein the substrate pritner is immobilized to a solid support. 

23. The process of claim 22, wherein the substrate primer is immobilized in an array on the solid support. 

24. The process of claim 21 , further comprising amplifying the telomerase extended substrate primer prior to mass 
spectrometry. 

25. The method of claim 1 8 that is a method for identifying cells or tissues transformed by a mutant proto-oncogene, 
comprising: 

a) in a cell or tissue sample, amplifying a portion of a proto-oncogene that includes a codon indicative of 
transfonnation, wherein one primer comprises a lini<er moiety for immobilization; 

b) immobilizing DNA via the liniter moiety to a solid support, optionally in the form of an array; 

c) hybridizing a primer complementary to the proto-oncogene sequence that is upstream from the codon 

d) adding 3dNTPs/1 ddNTP and DNA polymerase and extending the hybridized primer to the next ddNTP 
location; 

e) lonizing/volatizing the sample; and 

f) detecting the mass of an extended DNA indicative of a mutant proto-oncogene, thereby identifying cells or 
tissues transformed by a mutant proto-oncogene. 

26. The method of claim 25, wherein the proto-oncogene is the RET proto-oncogene. 
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27. The method of claim 26, wherein the codon indicative of transfomr>alion is codon 634 of the RET proto-oncogene. 

28. The method of claim 18 that Is a method for detecting expression of a tumor-specific gene, comprising: 

5 a) isoiating polyA RNA from the sample; 

b) preparing a cDNA ilbrary using reverse transcription; 

c) amplifying a cDNA product, or portion thereof, of the tumor-specific gene with a set of primers, wherein one 
primer of the set of primers comprises a iinl<er moiety; 

d) isolating the amplified product by immobilizing the DNA to a solid support via the linker moiety; 
^0 e) optionally conditioning the DNA; and 

f) ionizing/volatizing sample and detecting the presence of a DNA peak that is indicative of expression of the 
gene. 

29. The method of claim 28, wherein the cells are bone marrow cells, the gene Is the tyrosine hydroxylase gene, and 
^5 expression of the gene is indicative of neuroblastoma. 

30. A method for directly detecting a double-stranded nucleic acid using matrix-assisted laser desoiption/ionization 
time-of-flight (MALDl-TOF) mass spectrometry, comprising: 

^0 a) isolating a double-stranded DNA fragment from a cell or tissue sample; 

b) preparing the double-stranded DNA for analysis under conditions that increase the ratio of dsDNA:ssDNA, 
wherein the conditions include one or both of the following: preparing samples for analysis at a temperature 
of about 4*C or less, and using a high concentration of DNA in the matrix to drive duplex fomnation; 

c) ionizing/votatizing the DNA of step b), using a low acceleration voltage; and 

d) detecting the presence of the double-stranded DNA by MALDI-TOF mass spectrometry. 

31 . A method for comparing DNA samples to discern relatedness or to detect mutations, comprising: 

a) obtaining a plurality of biological samples; 
30 b) amplifying a region of DNA from each sample that contains two or more microsatellite DNA repeat sequenc- 

es; 

c) detecting the presence of the amplified DNA from each sample by mass spectrometry and comparing the 
molecular weights of the amplified DNA, wherein different sizes are indicative of non-identity or mutations 
between or among the samples. 

35 

32. The method of claim 31 , wherein non-identity is Indicative of the presence of a mutation in the DNA in one sample, 
of non- relatedness, or of non-HLA compatibility between or among the individuals from whom the samples were 
obtained. 

40 33. The method of claim 31 or 32, wherein a plurality of markers are examined simultaneously 
34. A method for identifying a target nucleotide in a nucleic acid sequence, comprising: 

a) amplifying at least a portion of the nucleic acid sequence comprising the target nucleotide using; 

45 

(i) a first primer, wherein: 

the 5'-end of the primer shares identity to a portion of the nucleic acid sequence immediately down- 
stream from the target nucleotide followed by a sequence encoding a unique restriction endonuclease 
50 site, and 

the 3*-end of the primer is self -complementary; and 

(ii) a second downstream primer that contains a linker moiety, 

thereby producing an amplified double stranded nucleic acid comprising at least a portion of the 
55 nucJeic acid sequence comprising the target nucleotide; 

b) immobilizing the amplified double-stranded nucleic acid to a solid support via the [inker moiety; 

c) denaturing the immobilized nucleic acid and isolating the non-immobilized strand; 
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d) annealing the intracomplennentary sequences in the 3'-end of tiie isolated non-immobhized strand, such 
tiiat the 3'-end is extendable by a poiymerase, ttiereby producing a self-annealed nucleic acid; 
f) extending tlie self-annealed nucleic acid by incubation with a polynnerase, 3 dNTPs and 1 ddNTP, which 
corresponds to the missing dNTP, thereby producing an extended nucleic acid; 
5 g) cleaving the extended nucleic acid with a restriction endonuclease specific for the unique restriction endo- 

nuclease site; and 
h) identifying the target nucleotide. 

35. The method of claim 34, wherein identifying the target nucleotide is indicative of a mutation In the nucleic acid 
10 sequence. 

36. The method of claim 34, wherein the target nucleotide is identified based on the mass of the extended nucleic acid. 

37. The method of claim 35, wherein the mass of the extended nucleic acid is determined by mass spectrometry. 

15 

38. A method for detecting a target nucleic acid in a biological sample using RNA amplification, comprising: 

a) amplifying a target nucleic acid using a primer comprising a sequence that is complementary to the target 
sequence, or a complement thereof, and a sequence that encodes an RNA polymerase promoter; 
20 b) synthesizing RNA using an RNA polymerase that recognizes the promoter; and 

c) detecting the resulting RNA using mass spectrometry, thereby detecting the presence of the target nucleic 
acid sequence in the biological sample. 

39. A process of detecting the presence of a target nucleic acid sequence, comprising the steps of: 

25 

a) incubating in a reaction mixture 


i) an RNA polymerase, 

ii) nucleoside triphosphates, and 

30 iii) a nucleic acid molecule comprising the target nucleic acid sequence, or a complement thereof, and a 

promoter for the RNA polymerase, 

thereby producing an RNA molecule comprising the target nucleic acid sequence, or a complement 
thereof; and 

35 b) detecting the RNA molecule by mass spectrometry, thereby detecting the presence of the target nucleic 

acid sequence. 

40. The process of claim 39, wherein the nucleic acid molecule comprising the target nucleic acid sequence Is DNA 
and the RNA polymerase is a DNA dependent RNA polymerase. 


40 


41. The process of claim 40, further comprising, following step a) and prior to step b), 
inactivating the RNA polymerase; and 
digesting the DNA using RNAse-free DNAse I. 


^5 42. The process of claim 39, further comprising: 


hybridizing a detector oligonucleotide to the RNA molecule comprising the target nucleic acid sequence, 
wherein the detector oligonucleotide is complementary to a portion of the target nucleic acid sequence; and 
removing unhybrldized detector oligonucleotide; 

50 

wherein, in step b), the RNA molecule comprising the target nucleic acid sequence is detected by detecting 
hybridized detector oligonucleotide. 

43. A method for detecting the presence of a target nucleic acid sequence, comprising: 

55 

a) amplifying the target nucleic acid sequence using a primer comprising a sequence that is complementary 
to at least a portion of the target sequence, or a complement thereof, and a sequence that encodes an RNA 
polymerase promoter, thereby producing an amplified nucleic acid molecule comprising the target nucleic acid. 
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or complement thereof , and an RNA polymerase promoter; 

b) incubating the amplified nucleic acid molecule with nucleoside triphosphates and an RNA polymerase that 
recognizes the promoter, thereby producing an RNA corresponding to the target nucleic acid sequence; and 

c) detecting the RNA using mass spectrometry, thereby detecting the presence of the target nucleic acid se- 
5 quence. 

44. A process for detecting a target nucleic acid sequence present In a biological sample, comprising the steps of: 

a) obtaining a nucleic add molecule containing a target nucleic acid sequence from a biological sample; 

b) immobilizing the target nucleic acid sequence on a solid support vjathlol linkages, whereby thetarget nucleic 
acid sequence is present at a sufficient density to detect it using mass spectrometry; 

c) hybridizing a detector oligonucleotide with the target nucleic acid sequence; 

d) removing unhybridized detector oligonucleotide; 

e) ionizing and volatizing the product of step c); and 

^5 f) detecting the detector oligonucleotide by mass spectrometry, 

wherein detection of the detector oligonucleotide Indicates the presence of thetarget nucleic acid sequence in the 
biological sample. 

^0 45. The process of claim 44, wherein the target nucleic acid sequence is amplified prior to immobilization. 

46. The process of claim 44 or 45, wherein at least one of the detector oligonucleotide or the target nucleic acid 
sequence has been conditioned. 

25 47. A process of any of claims 44-46, wherein the solid support Is selected from the group consisting of: beads, flat 
surfaces, pins and combs. 

48- A process of any off claims 44-47, wherein the target nucleic acid Is immobilized In the form of an array. 
30 49. A process of any of claims 44-48, wherein the support is a silicon wafer. 

50. A process of any of claims 44-49. wherein the target nucleic acid sequence is amplified by an amplification pro- 
cedure selected from the group consisting of cloning, transcription, the polymerase chain reaction, the ligase chain 
reaction, and strand displacement amplification. 

35 

51. A process of any of claims 44-50, wherein the mass spectrometer is selected from the group consisting of: Matrix- 
Assisted Laser Desorption/lonization Time-of-Flight, Electrospray, Ion Cyclotron Resonance, and Fourier Trans- 
form. 

40 52. A process of any of claims 44-51 , wherein the sample is conditioned by mass differentiating at least two detector 
oligonucleotides or oligonucleotide mimetlcs to detect and distinguish at least two target nucleic acid sequences 
simultaneously. 

53. A process of claim 52, wherein the mass differentiation is achieved by differences In the length or sequence of the 
^5 at least two oligonucleotides. 

54. A process of claim 53, wherein the mass differentiation is achieved by the introduction of mass modifying func- 
tionalities in the base, sugar or phosphate moiety of the detector oligonucleotides. 

so 55. A process of claim 52, wherein the mass differentiation Is achieved by exchange of cations at the phosphodlester 
bond. 

56. A process of any of claims 44-55, wherein the nucleic acid molecule obtained from a biological sample is amplified 
into DNA using mass modified dideoxynucleoslde triphosphates and DNA dependent DMA polymerase prior to 

ss mass spactrometric detection. 

57, A process of any of claims 44-56, wherein the nucleic acid molecule obtained from a biological sample is amplified 
into RNA using mass modified ribonucleoside triphosphates and DNA dependent RNA polymerase prior to mass 


199 


EP 1 164 203 A2 


spectrometric detection. 

58. A process of any of claims 44-57, wheroin the target nucleic acid sequence is indicative of a disease or condition 
selected from the group consisting of a genetic disease, a ciironnosonnal abnormality, a genetic predisposition, a 
viral infection, a fungal infection and a bacterial infection. 

59. A process of any of claims 39-56, wherein the detector oligonucleotide is a peptide nucleic acid. 

60. A primer, comprising at least about 20, preferably about 1 6, bases of any of the sequence of nucleotides sequences 
setforth in SEQ ID NOs. 32-38, 41-86, 89, 92, 95, 98, 101-110, 112-123, 126, 128 and 129. 

61. A primer, comprising at least about 20, preferably about 1 6, bases of any of the sequence of nucleotides sequences 
setforth In SEQ ID NOs, 1-22, 24, and 27-32. 

62. A primer of claim 60 or 61 that is unlabeled, and optionally includes a mass modifying moiety, which is preferably 
attached to the 5'end. 

63. The method of any of claims 1-17and the process of any one of claims 1 8-59, wherein a nucleic acid is immobilized 
to a solid support via a selectively cleavable linker. 

64. The method or the process of claim 63, wherein the linker is thermocleavable, enzymatically cleavable, photo- 
cleavable or chemically cleavable. 

65. The method or the process of claim 63, wherein the linker is a trityl linker 

66. The method or the process of claim 65, wherein the linker is selected from the group consisting of 1-(2-nltro-5- 
(3-0-4,4'-dimethoxytritylpropoxy)phenyl)-1-0-((2-cyanoethoxy)-diisopropylamlnophosphlno)ethane and 1-(4- 
(3-0-4,4'-dimethoxytrltylpropoxy)-3-methoxy-6'nitrophenyl)-1-0-({2-cyanoethoxy)-dilsopropylamlnophosphino) 
ethane. 

67. The method or process of any of claims 1-59 and 63-66 wherein a primer is a peptide nucleic acid. 
66. The primer of claims 60-62 that is a peptide nucleic acid. 
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